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ABSTRACT 


VERIFICATION  AND  CORRECTION 
OF  NON-PROCEDURAL  SPECIFICATIONS 
IN 

AUTOMATIC  GENERATION  OF  PROGRAMS 

by  S.  K.  Shastrv 
Supervisor:  Prof.  N.  S.  Prywes 

This  dissertation  deals  with  the  design  and 
implementation  of  an  interactive  Automatic  Ptoqram 
Generator,  which  will  qenerate  PL/1  programs  from  a 
non-procedural  description  of  a problem  in  MODEL  (Module 
DEscription  Lanquaqe) . MODEL  has  many  characteristics  of 
very  high  level  languages,  beinq  at  the  same  time, 
descriptive.  MODEL  processor  is  quite  tolerant  of  errors 
in  a user  specification  (like'  incompleteness, 
inconsistencies,  and  ambiguities),  and  resolves  those 
errors,  either  by  qeneratinq  additional,  statements , or  by 
updating  the  user  supplied  statements. 

A special  form  of  gtanh  called,  "Array  Graph",  is  used 

1 

to  represent  the  MODEL  specification.  Informally,  an 
array  graph  is  a compact  representation  of  a conventional 
directed  graph.  Conventional  qtaph  algorithms  are  applied 
directly  to  the  array  graphs,  and  in  particular,  a 
necessary  and  sufficient  condition  for  "sequenceability" 


of  3itay  qraohs  is  derived. 

Atiay  qiaph  representation  has  been  found  auit.e  useful 
in  analyzing  MODEL  soecif ications  consistim  of  iterations 
and  recursions. 
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CHAPTER  1 


INTRODUCTION 

1.1  Problem  of  Automatic  Progt amming. 

For  a long  time  it  has  been  realized  that  technological 
advances  in  hardware  have  far  outstripped  those  in 
software.  In  addition,  because  each  software  must  be 
individually  handcrafted  by  a programmer,  the  cost  of 
software  is  rising.  To  reduce  this  problem,  several 
attempts  have  been  made  to  automate  software  systems.  As 
a result  a variety  of  compilers,  hiqher  level  languages, 
automated  aids,  etc.,  have  emerged.  These  tools  have 
served  as  a useful  aid  to  the  programmers,  but  are  not 
enough  to  simplify  the  software  production  problems  of 
today. 

The  research  reported  in  this  dissertation  is  aimed  at 
solving  the  rising  problems  of  software  development  by 
automating  some  phases  of  the  software  development 
process.  This  is  achieved  by  developing  a simple 
man-machine  interface,  in  which  the  user  describes  a 
problem  in  a simple  descriptive  language,  without  direct 
reference  to  a particular  computational  technique,  or  a 
particular  data  operation.  The  interface,  in  turn, 
generates  a program  for  the  problem  in  a high  level 
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language,  after  resolving  the  euois  in  the  user 
specification.  This  interface  is  quite  different  from  the 
conventional  lanquaqe  processors,  because,  the  user  in 
this  case,  need  not  be  a proa rammer#  and  his  specification 
of  a problem  could  be  completely  dis-otqanized.  The  need 
to  think  of  a problem  as  a seouence  of  actions  and 
controls  is  eliminated,  thus  reducinq  the  time  and  effort 
required  by  the  user  (or  a programmer)  to  solve  a problem. 

The  lanquaqe  used  for  the  Durpose  is  called  MODEL,  and 
it  resembles  mote  a mathematical  language  of  functional 
and  relational  exoressions.  A problem  description  in 
MODEL  almost  follows  the  rule  of  Donald  Michie  (MIC68) , 
that  is: 

"If  a passage  of  text  is  respectable 
mathematics,  in  generally  accepted 
notation,  then  it  will  compile." 

It  is  the  function  of  the  MODEL  processor  to  generate  the 

required  algorithm  (or  procedure)  from  the  non-procedural 

soecif ication  of  a problem  in  MODEL. 

O'Donnel  (0D077)  points  out  that: 

"Descriptive  languages  are  not  as 
popular  for  practical  computation  as 
procedural  languages,  partly  because 
of  the  efficient  implementation  of 

such  languages  may  be  very  difficult. 

•• 

• • • 

However,  the  MODEL  Processor  described  in  this 
dissertation  shows  how  an  efficient  non-procedural 
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language  processor  can  be  developed 


1.2  Summary  of  Research. 

The  ultimate  objective  of  the  research  reported  in  this 
dissertation  is  the  automation  of  the  software  development 
process.  A broad  overview  of  such  an  automated  system  is 
shown  in  Figure  1.1.  It  illustrates  the  differnt 
interactions  between  the  Automatic  Program  Generater 
(MODEL  Processor)  and  the  user.  The  different  parts  of 
the  MODEL  processor,  shown  in  the  Figure,  will  be 
described  later  in  Chapter  3.  For  the  purpose  of 
identifying  the  scope  of  this  dissertation,  another 
overview  of  the  MODEL  Processor  is  shown  in  Figure  1.2. 
In  this  figure,  the  Processor  is  divided  into  two  parts. 
This  dissertion  describes  the  design  and  implementation  of 
the  first  part  (namely.  Analysis  of  MODEL  specification) 
and  the  different  user  interfaces  (see  also,  the  Figure 
1.1).  The  second  part  of  the  MODEL  processor  is  not  yet 
implemented;  however,  the  techniques  used  in  the  earlier 


version  of  the  MODEL  Processor  ( PRY76A)  can  be  used  in  the 
design  of  this  part  of  the  system. 
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figure  f.l.  Overview  of  MOnfL  Processor. 
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1.3  Summat y of  Conti ibutions . 

The  main  conti ibutions  of  the  teseaich  reoorted  in  this 
paper  can  be  categorized  into  three  parts: 

(1) .  Design  and  development  of  algorithms  required 

for  the  analysis  of  MODEL  specification, 

(2) .  Development  of  a non-procedural  language, 

MODEL, 

and  (3).  Development  of  a simple  man-machine  interface. 

In  the  following,  the  above  contributions  are  described 
in  more  detail. 

1.3.1  Algor ithms  to  Analyze  MODEL  Specification. 

The  main  contribution  of  this  dissertation  is  in  the 
desiqn  of  the  algorithms  that  perform  the  analysis  of 
MODEL  statements.  The  functions  performed  by  some  of  the 
algorithms  ate: 

(1) .  Represent  the  MODEL  soecif ication  in  a directed 

graph. 

(2) .  Analyze  the  graph  and  check  for  completeness  and 

consistency. 

(3) .  Analyze  every  iterative  computation,  and  check  for 

consistency  in  the  scope  of  iterations  and  in  the 
specification  of  subscriots. 

(4) .  Analyze  every  recursive  relation  and  check  for 
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proper  initiation  and  termination  of  the  recursion. 

(5) .  Resolve  the  errors  detected  durinq  the  execution  of 

steps  (2),  (3)  and  (4)  above,  either  by  generating 
additional  statements,  or  by  updating  the  user 
supplied  statements. 

(6) .  Report  the  errors  encountered  during  the  analysis 

and  the  ways  those  errors  were  resolved. 

(7) .  Generate  user  documentation,  by  producing  various 

listings,  cross  reference  report,  and  so  on. 

The  novel  feature  of  the  MODEL  Processor  is  the 
representation  of  the  input  specification  in  a compact 
form  of  directed  graph,  called  array  graph,  and  then  apply 
the  conventional  graph  algorithms  directly  to  it.  Such 
representation  has  been  found  quite  useful  in  analyzing 
the  completeness  and  consistency  of  iterative  and 
recursive  relations.  Many  other  features  of  the 
specification  can  be  exploited  (parallelism,  as  an 
example)  from  this  simple  representation. 

Some  quite  useful  results  on  the  analysis  of  array 
graphs  have  been  derived  in  Chapter  9.  These  include  (1) 
a necessary  and  sufficient  condition  for  the 
"sequenceability"  of  array  graphs,  (2)  algorithms  for 
splitting  (or  decomposing)  array  graphs,  and  (3)  a 
condition  for  the  *schedulability”  of  array  graphs. 
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1.3.2  The  Non-procedural  Language , MODEL. 

Unlike  conventional  languages,  MODEL  is  a 
non-procedural  or  a descriptive  language.  Some  of  the 
important  characteristics  of  the  language  are: 

(1) .  Non-procedural, 

(2) .  Tolerant, 

(3)  . Simpler , and 

(4) .  More  powerful. 

The  non-procedutalness  of  the  language  is  quite 
important.  It  makes  the  task  of  the  user  very  easy  in 
describing  a problem.  He  does  not  have  to  think  of  a 
Droblem  in  terms  of  processes.  He  regards  the  computer  as 
a black  box  and  his  description  of  a problem  is  in  terms 
of  data  or  documents  used  as  input  or  output,  as  well  as 
formulae  that  describe  the  relationshios  between  the  data. 

MODEL  is  a formal  language,  and  uses  some  reserved 
words  and  punctuation  common  to  programming  languages. 
However,  MODEL  Processor  is  quite  tolerant  of  many 
omissions  and  errors  in  the  input  statements  by  either 
generating  new  statements  or  by  updating  the  user  supplied 
statements.  The  MODEL  Processor  automatically  resolves 
most  of  the  errors  in  the  user  soecif ication. 

MODEL  is  simple  to  use.  It  consists  essentially  of  two 
types  of  statements:  data  description  statements  and 
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assertions.  Data  description  statements  define  the  data 
items  used  in  the  specification,  and  assertions  define  the 
relationships  between  different  data  items,  MODEL  does 
not  have  control  statements  like,  DO,  GOTO,  etc.,  nor  does 
it  have  the  I/O  operations  like:  READ,  WRITE,  and  so  on. 
All  the  required  control  statements  and  I/O  operations  are 
generated  by  the  MODEL  Processor.  Most  important  of  all, 
because  of  the  non-procedural  nature  of  the  language,  the 
user  does  not  have  to  soecify  any  sequence  of  operations. 

Even  though  MODEL  has  a simple  structure,  it  is  quite 
powerful.  In  fact,  in  some  cases,  it  is  more  powerful 
than  the  conventional  languages.  User  can  specify  array 
operations  as  in  higher  level  lanquaqes  like  APL.  He  can 
specify  iterative  and  recursive  relations.  He  can  also 
specify  complex  data  structures,  similar  to,  but  more 
powerful  than  the  "REFER"  option  of  PL/1. 

■ Some  of  the  characteristics  of  MODEL  are  common  with 
the  characteristics  of  the  languages  used  in  the  earlier 
versions  of  MODEL  Processor  (see  RIN76  and  PRY76A) . The 
main  differences  between  the  different  versions  are 
described  in  Chapter  2. 

1.3.3  Man-Machine  Interface . 

The  important  requirement  in  designing  a non-procedural 
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language  processor  is  the  need  of  a simole  interface,  that 
analyzes  the  mathematical  semantics  of  the  language, 
therefore,  if  a relation  between  two  data  items  is 
specified  usinq  an  assertion,  it  may  be  possible  to  obtain 
a procedure  (or  a sequence  of  computations  --  programming 
semantics)  required  to  evaluate  the  item  beinq  defined. 
If  a user  has  not  supplied  enough  information,  then  there 
may  be  a way  to  generate  that  information.  Any  error  in 
the  input  specification  must  be  resolved  and  the  user  must 
be  advised  of  such  automatic  resolution.  Every  input 
statement  must  be  checked  to  see  if  it  fits  properly  in 
the  global  environment,  and  there  must  be  a way  to 
sequence  the  different  statements  so  as  to  obtain  a 
procedural  description  of  the  problem.  The  MODEL 
Processor  performs  the  above  functions  and  can  be  regarded 
as  a useful  tool  even  for  non-programmei s 'to  develop  quite 
sophisticated  programs.  As  can  be  seen  fiom  the  flowchart 
of  the  MODEL  Processor  in  Figure  1.1,  there  is  a lot  of 
interaction  between  the  user  and  the  processor  thru 
various  reports  and  documentation.  This  interaction  helps 
the  user  in  understanding  the  functions  of  the  generated 
program  and  in  correcting  his  errors.  It  also  helps  the 
processor  to  understand  the  user  specification  correctly. 

A true  interactive  man-machine  interface  should  look  as 


shown  in  Figure  1.3.  As  shown  in  the  fiqure,  the  user 
communicates  with  the  processor  in  simple  "English-like" 
sentences,  and  a preprocessor  (labelled  EMI  — English  to 
MODEL  Interface)  translates  the  input  into  MODEL 
statements.  This  preprocessor  may  also  use  a 
"domain-expert"  to  resolve  some  errors  (like 
incompleteness)  in  the  input  using  some  domain  dependent 
knowledge.  Even  though,  such  a preorocessor  has  been 
proposed  (see  PRY76B) , it  has  not  been  implemented  for  the 
MODEL  Processor. 
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1.4  Organization  of  the  Dissertation . 

Chapter  2 gives  the  motivation  for  the  research  and  a 
summary  of  other  related  work  on  Automatic  Programming. 
Chapter  3 gives  an  overview  of  the  MODEL  Processor, 
briefly  describing  each  of  its  phases. 

Chapter  4 describes  the  MODEL  language  and  can  be  used 
as  a reference  manual  for  MODEL.  As  an  example,  a simple 
Trasaction  Processing  problem  is  described  in  MODEL. 

Chapter  5 describes  Phase-1  of  the  MODEL  Processor, 
namely  the  syntax  analysis  phase. 

Chapter  6 describes  the  organization  of  the  associative 
memory,  dictionary,  and  the  precedence  matrix.  It  also 
describes  some  of  the  procedures  that  can  be  used  to 
create  and  access  the  contents  of  the  associative  memory, 
dictionary  and  the  precedence  matrix. 

Chapter  7 describes  the  organization  of  the  MODEL 
statement  database,  and  the  procedures  for  creating  and 
updating  the  database. 

Chapter  8 describes  Phase-2  of  the  MODEL  Processor,  the 
main  function  of  which  is  to  create  the  dictionary  and  the 
precedence  matrix. 

Chapter  9 describes  Phase-3  of  the  MODEL  Processor 
which  performs  the  completeness  and  consistency  analysis 
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of  the  MODEL  specification.  The  concept  of  array  graph  is 
also  introduced  in  this  chaptei.  Some  theoretical  results 
on  array  graphs  are  also  oresented  in  this  chapter. 

Chaptei  10  describes  dhase-4  of  the  MODEL  processor 
which  produces  different  user  lenorts. 

Chapter  11  gives  an  overview  of  the  last  three  phases 
of  the  MODEL  Processor  which  perform  the  sequencing, 
optimization  and  code  generation,  respectively.  Also,  it 
summarizes  the  conclusions  th3t  can  be  drawn  based  on  the 
research,  and  suggested  directions  for  further  research. 


CHAPTER  2 


BACKGROUND,  MOTIVATION,  AND  SUMMARY  OF  RELATED 
LITERATURE  AND  RESEARCH 


2.1  Introduction. 

Though  present  day  computers  ate  often  called  Automatic 
Computers,  they  are  in  fact  far  from  being  automatic.  It 
is  true  that  computers  provide  means  for  processing  large 
and  complex  programs  quickly  and  accurately.  However, 
problem  analysis  and  design,  coding  and  testing  precede 
the  use  of  the  -computer.  To  make  the  problem 
representation  easier,  numerous  computer  languages  have 
been  developed.  These  languages  are  means  of 
communication  between  the  user  and  the  computer.  The  main 
function  of  these  languages  is  to  translate  user  input 
statements  to  the  machine  instructions  of  the  computer. 
Depending  on  the  complexity  of  the  language,  each  user 
statement  may  produce  hundreds  of  machine  language 
instructions.  Thus,  these  high  level  languages  relieve 
the  user  from  the  detailed  description  of  a problem. 

Despite  the  SDeed  and  accuracy  of  computers,  the  main 
factor  in  determining  the  usability  of  a computer  is  the 
cost  of  solving  a problem.  This  cost  includes: 

(a).  Cost  of  describing  the  problem  in  a language  (this 
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problem  is  usually  handled  by  the  application 
programmer  who  writes  oroqtams  for  the  otoblem). 

(b) .  Cost  of  foi matting  the  input  requited  for  the 

execution  of  the  Program  (this  task  is  ususlly 
performed  by  the  clerical  staff  who  keypunch  or  type 
the  input  data  into  the  system) . 

(c) .  Cost  of  executinq  the  proqram  or  the  cost  of 

maintaining  the  system  (this  task  is  performed  by  the 
computer  with  some  helo  from  the  oroqtammer ) . 

Fiqute  2.1  shows  a simple  Dtoblem  solving  process.  In 
reality,  the  process  is  much  more  complicated  than  that 
shown  in  Figure  2.1.  The  computet  may  recognize  many 
errors  durinq  the  testing  and  execution  phases  of  the 
problem  solving  orocess.  Consequently  error  feedback 
loops  are  present  in  both  these  ohases.  This  is 
illustrated  in  Figure  2.2.  Output  can  be  obtained  only 
after  the  user  modifies  the  oroblem  description  to 
eliminate  the  errors.  Deoendina  on  the  user  proficiency 
in  the  lanquaqe  in  which  the  oroblem  is  represented  and  in 
the  computer  system  he  U3es,  the  above  feedback  process 
can  continue  for  many  iterations.  Thus,  the  cost  of 
programming  may  sometimes  take  a major  share  in  the  total 
cost  of  problem  solving.  This  is  because  the  user  may  not 
be  well  trained  both  in  the  field  in  which  the  problem  is 
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Figure  2.2.  A Problem  Solvinq  Process  with 
Error  Feedback  Loops. 
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to  be  solved  and  in  the  computer  language  in  which  the 
problem  is  to  be  represented. 

2.1.1  Software  Difficulties  in  the  use  of  Computers. 

As  indicated  in  the  earlier  section,  even  an 
experienced  programmer  may  spend  a lot  of  time  in 
debugging  a program  (that  is,  in  resolving  the  errors 
caused  during  the  testing  and  execution  phases).  Usually, 
most  time  is  spent  in  resolving  the  errors  in  the 
execution  phase.  This  is  because  present  day  language 
processors  are  not  intelligent  enough  to  analyze  the 
program  and  indicate  any  faults  in  the  program  logic.  The 
programmer  loses  control  over  the  program  when  it 
increases  in  size  and  complexity,  because,  he  can 
concentrate  only  on  a small  portion  of  the  program  at  a 
time  and  fails  to  find  out  the  faults  during  the  linking 
process  of  the  different  parts  of  the  program.  Though  the 
recent  structured  programming  techniques  have  eliminated 
some  of  these  problems,  the  programmer  must  be  well 
experienced  to  write  a program  with  few  errors. 
Unfortunately,  once  he  is  well  trained  in  a language,  he 
hardly  has  any  opportunity  to  write  programs.  His 
subordinates  will  then  go  thru  the  same  learning  process 
all  over  again. 
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2.1.2  Prereguis ite  Training. 

It  is  not  sufficient  just  to  have  training  in  the 
language  in  which  the  problem  is  to  be  solved.  The  user 
must  be  familiar  with  the  computer  system  he  uses. 
Usually,  a typical  aoplication  proqram  requires  a huge 
amount  of  Input/Output  activity  between  the  data  base  and 
the  computer.  The  cost  of  I/O  ooerations  in  executing  a 
program  can  take  a substantial  share  of  the  total  cost. 
Therefore,  the  programmer  must  be  familiar  with  the 
organization  of  the  computer  system,  so  that  he  can  use 
the  most  efficient  methods  to  represent  data  files.  There 
ate  some  otoqtamminq  lannuages  which  tolerate  the  user  's 
lack  of  knowledge  about  the  system  he  uses  by  providing 
default  attributes  for  the  data  files.  However,  I/O 
operations  in  such  a system  ate  quite  expensive. 

2.1.3  Proceduralness. 

When  a programmer  attempts  to  use  a computer  to  solve 
his  problem,  he  must  state  explicitly,  and  without 
ambiguity,  the  algorithm  to  be  used  in  the  solution  of  the 
problem.  This  algorithm  is  the  Dtecise  step  by  step 
method  of  the  solution.  Such  an  algorithm  must  have 
several  important  characteristics:  (1)  Finiteness  - the 
algorithm  must  end  after  a finite  number  of  steps,  (2) 
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Definitene3s  - every  step  in  the  algorithm  must  be  defined 
unambiguously,  and  (3)  Generality  - the  algorithm  must  be 
applicable  to  a general  type  of  problem  rather  than  to  a 
specific  problem. 

In  the  existing  languages  there  is  no  facility  to  check 
if  an  algorithm  has  the  above  characteristics.  Each  step 
of  an  algorithm  must  be  explicitly  specified  in  a 
procedural  language.  So,  incorrect  specification  of  an 
algorithm  causes  errors  in  testing  or  execution  phases  of 
the  problem  solving  process. 

2.1.4  General  Structure  of  a Software  Development  Process. 

From  the  discussions  of  the  previous  sections,  a 
software  development  process  can  be  described  by  the 
flowchart  outlined  in  Figure  2.3.  The  first  phase, 
"determination  of  requirements  and  the  cost  effectiveness 
evaluation",  is  preliminary  to  the  software  development 
process  and  is  concerned  with  the  qeneral  objectives  and 
pay-off  of  the  development.  The  "system  level 
specification"  is  usually  done  by  an  individual  who  is 
familiar  with  the  problem  for  which  the  software  is  to  be 
developed.  The  next  three  phases,  namely,  "partitioning 
of  the  system  and  data",  "oroqtam  generation"  and  "program 
maintenance"  are  done  by  the  application  programmer.  If 
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the  cost  of  maintaining  the  resulting  program  is  too  high 
or  the  program  performance  is  not  satisfactory,  then  some 
decisions  may  have  to  be  made  to  change  the  oriqinal 
specification.  We  ate  primarily  interested  in  automating 
Phase  (4)  of  the  software  development  process  depicted  in 
Figure  2.3. 

Many  other  overviews  of  the  software  development 
process  are  qiven  in  the  literature.  One  detailed 
overview  is  qiven  by  Ptywes  (PRY77A,  Figure  3).  In  this 
overview,  basically,  a software  development  process  is 
divided  into  a top  part  and  a bottom  part.  The  top  part 
performs  the  task  of  determining  data  orocessinq 
requirements  and  generates  a specification  in  a high  level 
language  acceptable  by  the  bottom  part. 

In  this  dissertation  I will  be  describing  the  design 
and  implementation  of  the  bottom  part.  An  interface  that 
pel  forms  the  functions  of  the  top  oart  was  proposed  by 
Ptywes  (PRY76B)  but  is  not  yet  implemented. 
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2.2  Research  on  Automating  Software  Development . 

Research  on  Automating  the  Software  Development  Process 
described  in  the  previous  section  has  been  underway  for 
many  years.  As  a result,  many  tools  like  VHLL  processors, 

Automatic  Programming  Systems  have  been  developed.  The 
Leview  papers  by  Prywes  ( PRY77A ) , Biermanr.  (BIE76)  , and 
Ruth  (RUT76)  give  a geneial  description  of  recent 
developments  in  the  broad  area  of  "Automatic  Programming". 

In  the  following,  some  of  the  works,  closely  related  to 
this  dissertation,  are  briefly  described. 

2.2.1  MODEL  System. 

The  first  version  of  the  MODEL  Processor  (MODEL-I)  was 
developed  by  Rin  (RIN76).  His  system  generated  programs 
for  "Transaction  Processing"  problems.  In  his  system,  the 
user  specification  consisted  of  the  definition  of  the  data 

structures  (data  description  statements)  and  relations  • 

between  the  data  names  (assertions)  using  the 
non-procedural  language  MODEL.  This  system  had  several 
drawbacks . 

(1).  The  language  was  not  too  easy  to  use.  The  user  had 
to  explicitly  define  every  variable  used  in  the 
specification.  The  assertions  used  the  syntax  of 
PL/I,  and  were  never  scanned.  Therefore,  the  user. 


again,  had  to  explicitly  specify  every  source  and 
target  varaible  used  in  an  assertion. 

(2) .  The  system  was  restricted  to  a certain  class  of 

problems.  It  could  handle  only  one  dtivinq  file, 
and  one  type  of  record  in  any  file. 

(3) .  Iterations  could  not  be  specified  easily. 

(4) .  Very  few  checks  were  performed  for  consistency  and 

completeness  of  specification.  The  system  did  not 
attempt  to  resolve  any  error  in  the  user 
specification. 

Most  of  the  above  problems  were  resolved  in  the  second 
version  of  the  MODEL  system  (MODEL-II)  system  developed  by 
the  Automatic  Program  Generation  Group  at  the  Moore  School 
(of  which  the  author  was  a member,  see  also,  PRY76A) . In 
MODEL-II,  the  syntax  of  the  lanquaqe  was  simplified  to  a 
great  extent,  the  allowable  data  structure  was  extended, 
and  the  user  could  specify  the  iterations  fairly  easily. 
These  extensions  eliminated  the  drawbacks  (1),  (2)  and  (3) 
of  Pin's  system  (mentioned  above).  The  above  extensions 
were  based  on  the  specification  language  described  by 
Pnueli  ( PNU76A , PNU77) . 

The  MODEL  system  described  in  this  dissertation  (will 
be  referred  to  as  MODEL-III,  or  simply  MODEL)  is  different 
from  the  earlier  versions  in  the  following  sense: 
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(1) .  The  language  is  simple,  and  the  user  is  not  reauired 

to  specify  every  detail. 

(2) .  The  language  is  mote  oowetful.  The  usei  can  soecify 

tecutsions  an^  iterations  easilv.  He  can  soecify 
aLLay  operations  without  the  use  of  subscripts  (as 
in  APL) . 

(3) ,  It  is  quite  tolerant  of  user  errors.  It  resolves 

the  errors  by  generating  new  statements  or  updatinq 
the  user  supplied  statements. 

(4) .  It  uses  a special  form  of  directed  qraph  to 

represent  the  MODEL  specification  and  analyzes  it 
for  consistency  and  completeness,  and  then, 
sequences  the  different  operations  implied  by  it. 
There  are  other  extensions  that  are  beinq  built  into 
the  MODEL  Processor.  They  are:  (1)  Adding  the  modelling 
and  simulation  techniaues  needed  in  natural  and  social 
sciences  to  the  knowledge  base  (GAN78) , (2)  Partitioning 

of  the  software  system  progressively  in  a top-down  manner 
into  sub-modules  (GIB75) , and  (3)  Exoloitation  of 
parallelism  in  MODEL  Proarams  (CHE78) . 

2.2.2  BDL  System. 

BDL  (Business  Definition  Language)  is  an  automatic 
programming  system  developed  by  IBM  (GOL75A,  GOL753, 
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HAM74 , HAM77  and  LEA77).  Programming  in  BDL  begins  by 
drawing  on  a screen  (of  an  interactive  Graphics  Display 
terminal)  labelled  boxes  representing  principal  operating 
entities  within  the  application  area  to  be  automated. 
These  boxes  are  connected  by  paths  indicating  the  lines  of 
communication  among  them.  The  semantics  associated  with 
this  part  of  BDL  is  that  the  step  from  which  the  path 
emanates  is  expected  to  produce  a set  of  documents  at  some 
instant  of  time.  Thus,  the  language  is  a data  flow 
language  with  steps  producing  all  their  output 
simultaneously  and  executing  whenever  their  input  is 
ready. 

Each  box  in  the  original  BDL  program  can  be  described 
further  using  another  BDL  proqtam.'  The  user  can  continue 
this  process  till  each  box  can  no  longer  be  sub-divided 
further.  At  this  time  he  has  to  indicate  the  details  of 
relation  between  the  group  of  input  documents  and  qroup  of 
output  documents.  This  is  achieved  by  a tabular  language. 
Part  of  this  tabular  program  for  a problem  is  shown  in 
Figure  2.4.  In  this  figure,  the  name  suffixed  by  " ’s"  is 
a repeating  data  name.  The  causality  for  the  name 
"Invoice"  is  given  as  "One  PER  LIKE  Order's".  LIKE 
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Order  's  aie  defined  to  be  "Order  *s  WITH  COMMON  Custt". 


I Group/Field  I Causality/  I Name 

I I Derivation  I 


I Definition  I 


T Inv6TceTs~T  (5ne  PER-Li~ke  r-Like~6tdetTs  ~T  Oidei  rs  WITH 
I I Order's  I I COMMON  Cust# 


I Order  's 


INPUT 


Fiqure  2.4 


The  flowchart  specif ication  and  the  tabular 
specification,  toqether  form  a BDL  specification  for  a 
oroblem.  The  flowchart  specification  of  a problem  in  BDL 
may  be  quite  easy  for  non-proqr ammer s , because,  they  do 
not  have  to  learn  the  syntax  and  semantics  of  the  formal 
specification  problem.  However,  the  tabular  lanquaqe  has 
a lot  of  syntax  built  into  it  (for  examole,  keywords  like 
WITH,  COMMON,  LIKE  and  so  on)  and  may  be  difficult  for  use 
by  non-ptoqtammers. 


2.2.3  PSI  System. 


The  PSI  system  developed  at  Stanford  (GRE76) , uses  as 
in  SDL,  a Model  based  approach.  However,  in  this  system 
provision  is  made  for  the  incorporation  of  an  independent 
domain  expert  module  that  "knows"  the  terminoloqy  of  a 
particular  domain  and  is  familiar  with  the  common 
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activities.  Thus,  for  different  problem  domains,  just  , 

this  domain  expert  has  to  be  replaced,  without  any  change 
to  the  test  of  the  system.  The  function  of  the  domain 

r 

expert  is  to  free  the  user  from  making  explicit 
descriptions  and  associations  of  things  that  are  obvious 
to  him. 

2.2.4  USC/ I SI  System. 

The  APS  (Automatic  Programming  System,  BAL72,  BAL74, 

HEI76)  developed  by  the  qroup  at  ISI  is  similar  to  the  PSI 
system.  However,  the  important  difference  is  its  emphasis 
on  "domain  independence".  This  means  that  the  system  is 
not  primed  with  information  about  the  specific  problem 
area,  but  must  obtain  all  of  this  information.  The 
user-system  dialoq  consists  of  the  user  initially  statinq 
his  problem,  from  which  the  system  constructs  a "loose 
model".  Then  the  system,  through  a process  called  "model 
completion”,  attempts  to  transform  this  loose  model  into 

■ t 

an  operational,  interpretable  form  called  "precise  model". 

The  model  completion  process  usually  requires  further 
dialogue  with  the  user. 

2.2.5  SURGE  System. 


SURGE  is  a COBOL  Dr e-Drocessor  (PET76)  for  the  machine 
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generation  of  source  nroqrams.  The  caoabilities  of  SURGE 

i 

aie  described  as  file  sorting,  selective  retrieval  and 
tabulat  report  preparation,  including  multiple  levels  of 
totals  and  caoability  for  certain  other  kinds  of 
computation.  It  has  however,  a riqid  syntax,  and  is  not 
applicable  to  problem  areas  other  than  the  ones  mentioned 
above. 

2.2.6  PROTOSYSTEM- I . 

_l  1 T r r‘  " ' • 

PROTOSYSTEM-I  is  developed  by  the  Automatic  Program 
Generation  Grouo  at  MIT  (RUT76).  It  consists  of  top  and 
bottom  parts.  The  too  part  consists  of  the  man-machine 
interface  and  a knowledge  base  on  business  management  of 
inventory,  distribution,  or  procurement.  It  is  intended 
to  play  the  role  of  a manaqement  consultant  for  users  with 

such  business  operation  problems.  The  research  on  Program  f 

Writer,  by  Long  (LON77),  on  questionaire  aoptoaches 

employed  in  consultant's  role  by  Bosyi  (BOS76)  and 

Malhotra  (MAL75)  and  on  the  OWL  lanquaqe  used  for  the  user- 

communication  by  Martin  (MAR74)  are  some  of  the  works  on 

the  top  part  of  the  PROTOSYSTEM-I.  The  bottom  part  of 

PROTOSYSTEM-I  was  implemented  by  Ruth  ( RUT75 , RUT76A) 

which  obtains  a data  processing  specification  from  the  too 

part,  performs  svstem  design,  and  oenerates  PL/1  code. 
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The  system  specification  lanquaqe  (SSL)  used  in  the  bottom 
part  of  PROTOSYSTEM- I somewhat  resembles  MODEL.  For 
example,  a relation  in  SSL  looks  like: 

PAY  IS  HOURS_WORKED  * RATE 

Repetition  inherent  in  the  data  items  PAY,  HOURS_WORKED 
are  implicit  here,  just  as  in  MODEL. 

2.2.7  System  for  Business  Automation. 

The  System  for  Business  Automation  (SBA)  is  developied 
at  IBM  (ZL077b,  PET77)  and  is  a hiqh  level  data  base 
management  lanquaqe.  A subset  of  SBA,  called 
Query-BY-Example  is  also  described  in  many  reports 
(ZL077A,  ZL075A,  ZL076  and  ZL075B) . These  languages 
provide  provisions  to  query,  update,  define,  and  control  a 
relational  data  base.  In  these  systems,  when  the  user 
performs  an  operation  against  the  data  base,  he  fills  in 
an  example  of  a solution  to  that  operation  in  skeleton 
tables  that  can  be  associated  with  actual  tables  in  the 
data  base.  As  opposed  to  English-like  query  languages  by 
which  the  user  has  to  conform  to  the  phrase  structure  of 
the  lanquaqe,  the  Ouery-by-example  may  enter  any 
expression  as  an  entry  as  long  as  it  is  syntactically 


correct 
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2.2.8  LUCID. 

LUCID  (ASH77)  is  a formal  system  in  which  a 
non-procedural  lanquage,  LUCID,  is  used  to  write  programs, 
and  at  the  same  time  verify  them.  A LUCID  program  can  be 
thought  of  as  a collection  of  commands  describing  an 
algorithm  in  terms  of  assignments  and  loops,  but  at  the 
same  time,  LUCID  is  strictly  a denotational  lanquage,  and 
the  statements  of  a LUCID  program  can  be  interpreted  as 
true  mathematical  assertions  about  the  results  and  effects 
of  the  program.  As  in  PROTOSYSTEM-I , LUCID  has  the  single 
assignment  property,  and  each  assignment  in  LUCID  is 
equivalent  to  an  equation.  Even  though,  LUCID  does  not 
employ  the  conceot  of  arrays  (at  least  in  the  initial 
implementation),  the  "history"  of  a variable  can  be  used 
to  interpret  an  array.  The  operators  like,  "first",  and 
"next",  are  used  to  define  iteration,  along  with  the 
operators  like  "as  soon  as"  (like  the  keyword  ENOGRP  in 
MODEL)  are  used  to  terminate  iterations. 

Even  thouqh  LUCID  is  claimed  to  be  a non-procedural 
lanquage,  programs  in  LUCID  do  not  look  different  from  the 
iterative  proqtams  in  conventional  languages.  The  user 
has  to  think  of  loops  and  loop  control  variables  when 
trying  to  write  a LUCID  program.  For  examole,  a LUCID 
program  to  compute  a factorial  can  be  written  as  follows: 
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N * first  input; 

first  F * 1; 

first  1*1; 

next  F * F * I; 

next  1*1+1; 

output  * F as  soon  as  I >*  N; 

which  is  no  better  than  a procedural  program  in  a 
conventional  language. 

2.2.9  NOPAL  System. 

The  NOPAL  system  is  developed  at  the  Moore  School,  for- 
designing  functional  and  fault  isolation  tests  of  analog 
circuit  equipment  and  for  generating  corresponding 
programs  for  computer  controlled  automatic  test  equipment 
( PRY75A) . NOPAL  system  consists  of  two  parts:  the  top 
part  and  the  bottom  part.  The  top  part  was  developed  by 
Tinaztepe  (TIN77)  and  determines  efficient  tests  and  their 
respective  diagnoses.  The  bottom  part  was  developed  by 
Chang  (CHA  77)  and  accepts  as  input,  the  tests  specified 
in  the  NOPAL  language  (CHE76)  and  produces  as  output,  an 
efficient  program  in  the  OPAL  high  level  test  equipment 
programming  language. 

2.2.10  IS DOS. 

The  ISDOS  (Information  System  Desiqn  and  Optimization 
System)  project  at  the  University  of  Michigan  has 
developed  a documentation  aid  for  composing  and  reporting 
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system  specifications  (TEI72).  It  uses  a Problem 

t 

Statement  Lanquaqe  (PSL)  to  expiess  system  functional 
SDecif ication  formally.  This  system  is  a useful  tool  for 
a problem  definer  to  express  a problem  without  havinq  to 
specify  how  the  task  will  be  carried  out.  This 
specification  is  analyzed  by  another  system  called  Problem 
Statement  Analyzer  (PSA)  which  (1)  performs  consistency 

and  completeness  analysis  and  (2)  performs  dynamic  ' 

analysis  to  indicate  time  dependent  relationships  of  data. 

The  result  of  these  analyses  is  an  error  free  problem  ‘ 

statement  in  machine  readable  format.  The  second  output 

* 

is  a coded  statement  for  use  in  physical  system  desiqn 
process. 


CHAPTER  3 


OVERVIEW  OF  THE  MODEL  PROCESSOR 

In  this  chapter,  the  functions  of  the  MODEL  Processor 
are  briefly  described.  The  MODEL  Processor  can  be  divided 
into  the  following  7 phases: 

1.  Syntax  Analysis. 

2.  Determination  of  Precedence  Relationships. 

3.  Analysis  of  the  Directed  Graph. 

f 

4.  Documentation. 

5.  Seouencing. 

6.  Cod?  Generation. 

7.  Compilation  and  execution  of  the  generated 
Program. 

Figure  3.1  gives  an  overview  of  the  MODEL  Processor.  In 
the  following  sections,  the  different  phases  of  the  MODEL 
processor  are  briefly  described. 

3.1  Syntax  Analysis. 

In  this  phase  of  the  MODEL  processor,  the  MODEL 
statements  are  solicited  from  the  user,  one  at  a 
time,  and  each  statement  is  analyzed  for  proper- 
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Flr.urc  3.1.  Overview  of  MOnn,  Processor. 
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Alternately,  the  user  can  use  a conventional  text  editor 
to  create  a file  of  MODEL  statements  and  save  it  in  a 
secondary  storaqe.  The  syntax  analyzer  will  then  obtain 
statements  from  the  file,  one  at  a time,  and  aqain,  each 
statement  is  analyzed  for  proper  syntax.  Any  error  found 
durinq  the  syntax  analysis  is  reported  to  the  user.  A 
novice  can  also  interact  with  the  processor  to  obtain  the 
proper  syntax  of  a particular  statement,  or  to  obtain  an 
explanation  of  the  error  reported  by  the  system.  This 
phase  is  furter  described  in  Chapter  5. 

After  successfully  analyzinq  a MODEL  statement  for 
proper  syntax,  it  is  saved  in  the  associative  memory. 
Associative  Memory,  as  shown  in  fiqure  3.1,  consists  of 
the  following  4 parts: 


i 


1.  Directory 

2.  Storage  entry  area 

3.  Dictionary 

and  4.  Precedence  Matrix. 

Directory  consists  of  one  entry  for  each  name  used  by 
the  user  and  one  entry  for  each  reserved  key  name  used  in 
the  system.  Storaqe  entry  area  consists  of  a storaqe 
entry  for  each  MODEL  statement.  Only  the  directory  and 
the  storaqe  entries  are  created  durinq  the  syntax 


l 


analysis.  The  dictionary  and  the  Precedence  Matrix  ate 
created  during  the  second  chase  of  the  MODEL  otocessor 
(see  also,  section  3.2).  However,  the  directory  entries 
and  the  storaqe  entries  may  be  updated  even  during  the 
second  and  third  phases  of  the  MODEL  Processor,  either  by 
chanqinq  a user  suoplied  statement  or  by  generatinq 
additional  statements  to  resolve  some  user  errors. 
Organization  of  the  Associative  Memory  is  described  in 
Chapter  6. 

3.2  Determination  of  Precedence  Relationships . 

Durinq  this  phase,  the  statements  stored  in  the 
associative  memory  are  analyzed.  As  a result,  it 
qenerates  two  additional  carts  of  the  associative  memory, 
namely,  the  dictionary  and  the  precedence  matrix. 

The  dictionary  is  similar  to  the  directory  of  the 
associative  memory,  but  contains  more  information  about 
each  data  name  used  in  the  soecif ication.  Also,  for  a 
data  name  there  can  exist  only  one  directory  entry,  but 
the  same  name  can  be  represented  by  more  than  one 
dictionary  entries.  These  different  dictionary  entries 
identify  the  different  representations  of  the  same  name, 
which  have  different  parents  or  different  subscripts. 

The  precednece  matrix  contains  precedence  relationships 


between  the  different  dictionary  entries.  These 
precedence  relationsh iDS  are  determined  by  analyzinq  the 
description  of  each  statement.  This  precedence  matrix  is 
also  called  as  weiqhted  adjacency  matrix  which  represents 
a directed  graph  defining  the  precedence  relationships 
(edges  of  the  qraph)  between  the  different  dictionary 
entries  (nodes  of  the  graoh). 

The  directed  graph  represented  by  the  precedence  matrix 
is  farther  analyzed  during  the  third  phase  of  the  MODEL 
processor  (see  also  the  section  3.3). 

During  the  generation  of  the  dictionary  and  the 
precedence  matrix,  some  forms  of  incompleteness  are 
resolved  by  generating  additional  data  description 
statements.  This  partial  resolution  of  incompleteness 
simplifies  the  analysis  of  the  qraph  later  on.  This  phase 
is  further  described  in  Chapter  8. 

3.3  Analysis  of  the  Directed  Graph. 


w 


In  this  phase,  the  directed  graph  is  analyzed  for 
inconsistency,  incompleteness  and  cycles.  Inconsistency 
can  be  due  to  invalid  subscript  range  specification,  or 
due  to  inconsistent  use  of  subscript  names. 
Incompleteness  can  be  due  to  the  omission  of  the  data 
description  statements  for  some  data  names  or  the  omission 
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of  an  assertion  that  evaluates  a field  of  an  outout  file. 
Most  of  these  incompletenesses  and  inconsistencies  are 
automatically  resolved  and  are  repotted  to  the  user  for 
his  verification.  Unresolvable  errors  are  also  reported 
to  the  user,  and  the  user  response  is  appropriately 
processed. 

The  directed  qtaoh  should  not  contain  any  cycles.  If 
it  does,  it  cannot  be  sequenced.  Therefore,  all  the 
assertions  that  form  a cycle  are  further  analyzed;  and  if 
they  represent  a set  of  simultaneous  equations,  then  they 
ate  "tr ianqulat ized"  usinn  standard  solution  methods.  If 
block  trianqularization  is  not  possible,  then  the  error  is 
reported  to  the  user.  This  phase  is  further  described  in 
Chaoter  9. 

3.4  Documentation. 


This  phase  qenerates  some  reports  from  the  analysis  of 
the  MODEL  specification.  One  of  the  qenerated  reports  is 
the  cross  reference  report  which  provides  an  alphabetical 
listinq  of  all  the  names  provided  by  the  user.  For  each 
name,  the  report  provides  the  statement  number  in  which 
the  name  was  described,  the  statement  numbers  of 
statements  in  which  it  was  referenced,  and  the  attributes 
of  the  name. 
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The  second  reoort  is  the  foimatted  MODEL  specification, 
which  is  a complete  MODEL  specification  of  the  oroblem. 
This  foimatted  specification  is  also  saved  in  the  MODEL 
statement  library,  so  that  any  section  of  it  can  be 
referenced  by  another  user,  by  appropriately  identifying 
the  module  name  and  the  section  name.  This  phase  is 
further  described  in  Chapter  10. 

3.5  Sequencing. 

In  this  phase,  the  directed  graph  is  sequenced  using 
standard  topological  sorting  techniques.  As  a result,  a 
flow  chart-like  report  is  generated,  which  describes  the 
sequence  and  control  logic  of  the  desired  module.  This 
phase  of  the  MODEL  processor  has  not  been  implemented; 
however,  the  techniques  used  in  the  earlier  version  of  the 
system  ( PRY76A)  can  be  used  in  the  design  of  this  phase 
(see  also  the  Chapter  11). 

3.6  Code  Generation. 

In  this  phase,  the  proqtam  for  the  module  is  generated 
in  a high  level  language,  PL/1.  It  uses  the  flowchart 
output  of  the  sequencino  phase  and  inserts  appropriate  I/O 
commands  and  control  statements,  thus  obtaining  an 
executable  PL/1  program.  This  phase  of  the  MODEL 
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processor  has  not  been  implemented ; howevei  , the 
techniques  used  in  the  earlier  version  of  the  system 
( PRY76A)  can  be  used  in  the  design  of  this  phase  (see  also 
the  Chapter  11). 

3.7  Compilation  and  Execution  of  the  Generated  Program. 

The  proqram  generated  in  the  previous  ohase  is  now 
compiled  using  the  PL/1  Optimizing  Compiler,  thus 
generating  an  object  module  or  load  module.  This  object 
or  load  module  can  then  be  executed  by  traditional  means. 
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CHAPTER  4 


USER  REFERENCE  MANUAL  FOR  MODEL 

This  chapter  serves  as  a reference  manual  for  using  the 
MODEL  language.  It  describes  in  detail,  the  various 
statements  in  MODEL.  For  each  statement,  its  purpose, 
syntax,  and  semantics  are  given,  along  with  some  examples. 

As  an  illustrative  example,  a transaction  processing 
problem  is  described  in  MODEL  in  section  4.8. 

4.1  General  Information . 

A specification  of  a module  in  MODEL  consists  of  a set 
of  statements  which  describe  the  desired  module.  These 
statements  can  occur  in  any  order.  However,  the  different 
types  of  statements  can  be  divided  into  the  following 
three  sections. 

(1) .  Header  section:  the  statements  in  this 
section  describe  the  source  and  target  files  used 
in  the  module,  and  any  reference  to  other  sections 
of  the  MODEL  statement  database. 

(2) .  Data  Description  section:  the  statements  in 
this  section  describe  the  media,  files,  records, 
groups,  fields,  interim  and  subscript  names. 
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(3).  Assertions  sections  The  statements  in  this 
section  describe  the  relationship  between  the 
different  data  names  used  in  the  soecif ications. 

The  organization  of  different  MODEL  statements  is  shown 
in  the  form  of  a tree  structure  in  Fiqure  4.1. 

4.1.1  Syntax  Notation. 

In  the  description  of  the  syntax  of  MODEL  statements 
below,  upper  case  letters  refer  to  specific  MODEL 
vocabulary  or  to  user  orovided  names  and  angle-bracketed 
< > lower  case  letters  refer  to  a generic  class  for  which 
a specific  item  needs  to  be  substituted.  The  symbol 
means  "is  defined  as”.  Square  brackets  ( 1 indicate 
optional  portions  of  the  statements.  Asterisks  following 
square  brackets  [ ]*  siqnify  repetition  zero  or  more 
times.  The  "I"  symbol  means  "or",  and  is  used  for 
alternatives. 

The  following  convention  will  be  used  in  the 
description  of  syntax  in  the  following  sub-sections:  If  a 
syntax  statement  exceeds  a line,  then,  it  is  split  into 
two  or  more  lines,  the  second  and  subsequent  lines  being 
indented  by  at  least  4 blanks.  Also,  if  a keyword  in  a 
statement  has  many  representations,  and  some  of  those 
representations  are  equivalent,  then  only  those 
representations  that  are  equivalent  are  indicated  on  the 
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sane  line,  the  others  ate  indicated  on  subsequent  lines, 
properly  indented  (for  example,  see  the  syntax  of 
<oro-type>  in  section  4.4).  However,  if  none  of  the 
representations  are  equivalent,  then  all  the 
representations  nay  aopear  in  the  sane  line  (for  example, 
see  the  syntax  of  <diso-type>  in  section  4.4). 

The  formal  syntax  of  MODEL  is  provided  in  section  4.7 
in  Extended  Backus  Normal  Form  (EBNF)  specification 
language  as  a supplement. 

4.1.2  Format  of  MODEL  Statements . 

MODEL  statements  must  be  in  one  of  the  following 
formats : 

(1) .  Fixed  (or  Fixed-blocked)  with  record  size  * 

BO. 

(2) .  Variable  (or  variable-blocked)  with  maximum 
record  size  * 90. 

Any  MODEL  statement  can  span  the  first  72  characters  of 
each  record.  Columns  73  thru  90  specify  the  line  number- 
associated  with  the  statement.  The  line  numbers  in 
consecutive  records  need  not  be  in  sequence.  If  no  line 
number  is  specified  (that  is,  if  columns  73  thru  90 
contain  non-numeric  data),  then  the  MODEL  processor 
assigns  a line  number  to  each  statement,  which  is  the  same 


f 

| 

I 

• l 

t 

t 


» 


■i 

i 


- 47  - 


as  the  record  number.  A record  can  contain  any  number  of 
statements,  or  a statement  can  span  more  than  one  record. 
In  the  latter  case,  those  records  that  contain  the  same 
statement  must  be  in  sequence. 

Note  that  any  data  in  a record  beyond  column  73  is 
ignored. 

4.1.3  Names . 

Whenever  a <name>  is  indicated  in  the  statements  of 
MODEL,  it  may  consist  of  1 to  31  characters  described 
below.  Strictly  speakinq,  only  the  names  ABSENT,  ENDGRP, 
ENDFILE,  EXIST,  LENGTH,  POINTER  and  SUBSET  ate  the 
reserved  words  in  MODEL.  However,  to  avoid  ambiquity,  the 
following  key  words  also  may  not  be  used  for  names: 
MODULE,  SOURCE,  TARGET,  END,  SECTION,  REFER,  HELP,  IF. 
The  special  reserved  words  mentioned  above  are  described 
in  section  4.7. 

4.1.4  Character  Set . 

The  characters  which  can  be  used  to  form  names  can  be 
any  combination  of  1 to  31  letters,  digits,  or  special 
characters  defined  below,  but  the  first  character  of  a 
name  must  be  a letter. 

<letters>: :■  A I B | . . . I Y I Z I « | $ | 9 
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<diqits>  ::=  0 I 1 | ...  | 9 
<specia 1 -character > : : * 


4.1.6  Integers . 

Whenever  an  <integer>  is  indicated  it  may  be  any 
combination  of  diqits  with  the  value  from  0 to  32767, 
except  where  further  limits  are  indicated. 

4.1.6  Indication  of  End  of  Statement . 

The  character  can  be  used  ootionally  as  a 
delimiter  to  indicate  end  of  a statement.  As  this 
delimiter  is  optional,  it  will  not  be  shown  in  the  syntax 
of  statements  that  follow. 

There  is  one  risk  of  not  usinq  at  the  end  of  each 
statement:  that  is,  if  there  is  any  syntactical  error  in  a 
statement,  the  tecoid  containinq  the  statement  is 
discarded.  Therefore,  if  two  statements  were  specified  in 
a record,  and  a syntactical  error  was  found  in  the  first 
statement  of  that  record,  then  the  second  statement  is 
lost  if  no  was  used  to  separate  the  two  statements  in 
that  record.  Therefore  it  is  recommended  to  use  *>'  at 
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the  end  of  each  statement,  or  specify  only  one  MODEL 
statement  on  each  record (or  line).  This  is  further 
described  in  chapter  10. 
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4.1.7  Qualified  Names . 

Whenever  a <qual i f ied-name>  is  indicated  in  a 
statement,  it  may  consist  of  1 to  10  <name>s  connected 
together  as  shown  in  the  following  syntax: 

<qual i f ied-name> : : = <name>  [.<name>l* 

4.1.8  Some  Common  Terms. 

In  the  followinq,  the  syntax  of  some  commonly  used 
terms  are  qiven: 

<is>::«  IS  | ARE  | = 

<qualif ied-names>: :*  <aualif ied-name>  f , <qual i f ied-name> ] * 
<list-of-names>: :«  <name>  l,<name>)* 

<pname>::»  (<name>.1* 

<dd-name>::*  <pname>  <name> 

<list-of-dd-names>: :■  <dd-name>  l , <dd-name>l * 

4.1.9  Error  Handling. 

If  a syntactical  error  is  detected  in  a MODEL 
statement,  then  that  error  is  reported  to  the  user  bys 

(1) .  Displaying  the  input  line  which  contains  the  MODEL 
statement  in  erior, 

(2) .  Indicating  the  position  of  error  by  a • | • sign,  and 

(3) .  Displaying  the  error  code. 


- 50  - 


I 

v 

t 


Foi  examole,  in  the  following  statement: 

f 

A ^ ( 1 ) + C (X)  ; 

a *='  siqn  is  expected  between  the  letters  A and  B.  The 
Processor  will  display  the  error  as  shown  below: 

A 0(1)  + C(X) ; 

I 

WEQUAL  ERROR.  INVALID  TEXT  BEGINNING  *P  ' IN  LINE  XXX. 

STATEMENT  TILL  THE  NEXT  SEMICOLON  IS  DISCARDED. 

The  enrot  code  in  this  case  is  WEQUAL,  and  if  the 
description  of  the  error  code  is  required,  then  the  user 
might  type: 

HELP  WEQUAL 

(see  also  the  description  of  the  HELP  statement  in  the 
next  chapter). 

Also,  whenever  an  error  is  detected,  the  input 
statement  till  the  next  semicolon  is  discarded.  However, 
if  no  semicolon  is  found  in  the  line,  then  the  whole  line 
is  discarded. 

* i 

4.2  A Mini-manual  for  MODEL.  . 

This  section  gives  a simplified  description  of  the 
MODEL  statements  usinq  some  examples.  The  complete  syntax 
and  the  detailed  description  of  each  statement  is  given  in 
the  next  three  sections. 


9 

Essentially,  there  ate  two  types  of  statements  in 


MODEL: 


(1) .  Data  description  statements  which  describe  the 

structure  of  a file  and  its  sub-components,  and 

(2) .  Assertions  that  define  the  relationship  between 

different  data  items. 

4.2.1  Data  Description  Statements . 

The  data  description  statements  can  be  described  by  the 
following  simplified  syntax: 

<ddstmt>::»  <name>  IS  <stmt-type>  [ ( <arquments> ) ] 
<stmt-type>: :«  PILE  | RECORD  | GROUP  I FIELD 
| INTERIM  | SUBSCRIPT 

The  above  statements  will  be  described  usinq  the  example 
in  Figure  4.2.  The  statements  in  this  figure  describe  the 
structure  of  a file  called  INPUT.  It  consists  of  a set  of 
records  (number  of  records  in  the  set  is  indicated  by  the 
repetition  which  means  that  it  is  unknown  and  is  to 

be  determined  from  the  end-of-file  condition)  denoted  by 
INREC  (statement  2).  Each  record  consists  of  a field 
"NO_OF_ITEMS" , and  a group  of  two  fields  denoted  by  ITEMG. 
The  number  of  ITEMG  in  each  record  INREC  is  determined  by 
the  field  MO_OF_ITEMS.  ITEMG  consists  of  two  fields 
denoted  by  ITEM#  and  OUAnt  (statements  5 and  6).  Both  of 
these  items  are  3 characters  long.  Therefore,  the  length 
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of  each  ITEMG  is  6,  and  the  lenqth  of  each  record  is  qiven 
by  the  expression: 

2 + NO_OF_ ITEMS  * 6 

Fiqure  4.3  gives  the  physical  organization  of  the  IMPUT 
file,  in  which  the  first  two  records  are  shown.  In  this 
figure,  the  first  record  is  shown  to  have  two  ITEMG  and 
the  second,  only  one.  The  offset  of  individual  item  (from 
the  beginning  of  the  corresponding  record)  is  shown  at  the 
top  of  the  fiqure. 

In  a data  description  statement,  the  first  argument 
(for  example,  "INPUT”  in  statement  2 of  Fiqure  4.2) 
specifies  the  parent  name  of  that  data  item.  This  linking 
between  the  data  name  and  its  patent  allows  us  to 
interpret  every  file  descriotion  in  the  form  of  a tree 
structure.  For  examnle,  the  tree  structure  for  the 
statements  in  Figure  4.2  is  shown  in  Figure  4.4.  The 
other  arguments  of  a id  statement  describe  the  repetition 
of  that  item  (as  in  statements  2 and  4 of  Fiqure  4.2)  or 
the  length  and  attributes  of  that  item  (as  in  statements 
3,  5 and  6 of  Fiqure  4.2). 
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INPUT  IS  FILE 


INREC  IS  RECORD (INPUT, (*)) J 

N0_0F_ITEMS  IS  PIELD(INREC,PIC *99 ’) 
ITEMG  IS  GROUP(INREC, (NO_OF_ITEMS) ) 
ITEM#  IS  F I ELD (ITEMS, PIC '999 ') ; 
COUNT  IS  FIELD (ITEMS, PIC '999 ') ; 


Figure  4.2,  A Sample  File  Description 


ITEM#  QUANT  ITEM#  QUANT 


ITEM#  QUANT 


ITEMG(l)  ITEMG ( 2)  ITEMG(l) 


INREC(l)  INREC (2) 


Figure  4.3.  Layout  of  the  File  Described  in 
Figure  4.2. 
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INPUT 


Figure  4 


.4.  Tree  Structure  of  the  INPUT  file 
Described  in  Figure  4.2. 
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The  first  argument  (that  is,  Darent  name)  can  be 
omitted  in  some  instances.  In  this  case,  the  oreceedinq 
data  item  which  is  a level  higher  in  the  tree  structure  is 
assumed  to  be  the  parent.  Fot  example,  the  statement  5 in 
Fiqure  4.2  can  be  replaced  by: 

ITEM#  IS  FIELD  (PIC  *999  * ) ; 
without  any  change  in  the  semantics. 

All  the  fields  in  the  INPUT  file  in  Fiaure  4.2  were 
described  as  numeric  items  (diqits  0-9)  using  the 
attribute  PIC'9'.  However,  an  input  field  may  contain 
non-numeric  items,  in  which  case,  different  attributes  has 
to  be  used  (see  also  section  4.4).  Foi  examole,  the 
statement : 

NAME  13  FIELD (CHAR ( 20) ) 

describes  a field  which  is  a string  of  length  20. 

The  above  examples  give  a simplified  description  of 
some  of  the  data  description  statements  in  MODEL.  The 
detailed  descriotion  of  each  data  descriDtion  statement  is 
given  in  section  4.4. 
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4.2.2  Assertions. 

A simplified  syntax  for  the  assertions  is  given  below: 

<asser tions> : : * <for-clause>  <assertions> 

I <boolean-clause>  <assertions> 

I <simple-assertion> 

<simple-assertion> : : = <data-name>  = <expression> 

In  the  above  syntax,  the  <for-clause>  specifies  a set  of 
ranges  for  the  subscript  names  used  in  the  assertion;  and 
the  <boolean-clause>  is  a boolean  expression  used  to 
conditionally  execute  the  following  assertion.  The 
<simple-asser tion>  defines  the  relationship  between  the 
data-item  on  the  LHS  of  the  equal  sign  and  the  data  items 
on  the  RHS.  RHS  can  be  any  expression  using  regular- 
operators  like,  +,  -.  *,  /,  >=...,  etc.  The  data  items 

on  the  LHS  or  RHS  can  be  optionally  subscripted. 
Functions  can  also  be  used  in  the  expressions  on  the  RHS. 
Some  examples  of  assertions  are  shown  in  Fiqures  4.5  and 
4.6. 
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ITEM_SUM  = OUANT  * PRICE ( ITEM# ) ; 
TOTAL  = SUM(ITEM  SUM); 


Figure  4.5.  Some  examples  of  Assertions, 


FOR  I * 2 TO  NO_OF  ITEMS 

ITEM_SrJM  = OUANT  (I)  * PRICE  ( ITEM#  ( I ) ) * 0 . 9 ; 
TOTAL  * OUANT (1)  > PRICE ( ITEM #( 1 ) ) + 

SUM (ITEM  SUM); 


Figure  4.6.  Some  Additional  Examples  of  Assertions 
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Assuminq  that  the  data  name  PRICE  is  an  array 

containing  the  price  of  each  item,  and  the  data  names 
QUANT  and  ITEM!  refer  to  the  fields  described  in  Fiqure 
4.2,  then  the  two  assertions  in  Fiqure  4.5  compute  the 
total  price  (in  TOTAL)  of  all  items  in  the  record  INREC. 

The  assertions  in  Figure  4.6  also  perform  the  same 
function;  however,  a rebate  of  10%  is  applied  to  all  items 
except  the  first. 

The  data  name  ITEM_SUM  used  in  Figures  4.5  and  4.6  is 
assumed  to  be  a temporary  variable  (INTERIM)  and  no  data 
description  statement  for  that  name  is  necessary. 

However,  for  completeness  sake,  that  name  can  be  described 
just  like  a field  statement  (see  also  Figure  4.2). 

The  data  name  TOTAL  used  in  Fiqures  4.5  and  4.6  is 

assumed  to  be  a field  in  the  output  file  (SYSPRINT). 

Again,  a dd  statement  for  this  name  is  not  needed, 
however,  for  the  sake  of  completeness,  a set  of  statements 
can  be  supplied  as  in  Figure  4.2.  .1 

T 

The  first  statement  in  Figure  4.6  uses  a "for-clause" 
which  defines  the  ranqe  of  the  subscript  (I,  in  this  case) 
used  in  the  relation  that  follows  it.  If  no  range  is 
supplied,  then  the  entire  ranqe  of  the  subscript  is  used 
as  the  range.  For  example,  the  first  assertion  in  Fiqure 
4.5  is  equivalent  to: 
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FOR  I = 1 TO  EXIST. INREC  S J = 1 TO  NO_OF  ITEMS 
ITEM_SUM(I,J)  = QUANT ( I, J)  * 

PRICE  ( ITEM# (I,J) ) ; 

f 

Fortunately,  the  system  propagates  the  appropriate 
subscript  names  to  each  data  name,  and  therefore  the 
simplefied  form  of  the  data  names  can  be  used  as  shown  in 
Figures  4.5  and  4.6. 

The  EXIST. INREC  used  in  the  above  asseition  defines  the 
size  of  the  array  INREC  which  is  computed  from  the 
end-of-file  delimiter.  This  is  described  further  in  later 
chapters. 

The  above  examples  give  a simplified  description  of 
assertions  in  MODEL.  The  detailed  description  of 
assertions  and  that  of  other  related  statements  is  given 
in  section  4.5. 

I ' 

4.3  ~*eadet  Section . 

i 

The  headei  of  MODEL  specification  consists  of  four  jfr- 

statements,  module  name  statement,  source  file  statement, 
target  file  statement  and  refer  statement,  described 
below. 

4.3.1  Module  Name  Statement. 

. 

Purpose : to  give  a name  to  the  desired  module. 


'J 
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Syntax : 

<module-name-statement> : : * MODULE:  <name> 

I <name>  [ IS  ) MODULE 

Semantics : <name>  is  used  as  the  module  name.  If  no 

module  statement  exists,  then  the  name  '$MODULE ' is  given 
to  the  module. 


Example : 


MODULE:  DEPSALE 


4.3.2  Source  File  Statement . 

Pul pose : to  indicate  the  names  of  those  files  which  ate 
source  or  input  to  the  desired  module. 

Syntax : 

<source-f ile-statement>: : * <keywordl>  [ <keyword2>  ] : 

<list-of-qnames> 

I <list-of-qnames>  <is>  <keywotdl>  [ <keyword2>  1 

where : 

<keywordl>: : * SOURCE  | SOU 
<keyword2> : : * FILES  | FILE 
<list-of-qnames>: : * <qnames>  [,  <qnames>M 
<qname>::»  <name>  [ . <name>  1 

Semantics:  the  qualified  names,  given  by  <1 ist-of-qnames> 
are  source  files  to  the  module.  If  any  <qname>  in  list 
contains  two  <name>s,  then  the  first  <name>  is  the  module 
name  in  which  the  soprce  file  is  described  and  the  second 
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<name>  is  the  file  name  itself. 

If  no  file  descr iotion  statement  for  the  specified  file 
exists,  then  the  cor tespondino  file  description  is 

r 

obtained  from  the  MODEL  statement  database. 

ExamoJ^e : 

SOURCE  FILES:  DEPSALE . MASTER , TRAMS 
The  above  statement  indicates  that  the  files  named 
MASTER  and  TRANS  ate  source  files  to  the  module,  and  that 
the  file  description  of  MASTER  is  to  be  obtained  from  the 
description  of  the  module  DEPSALE  in  the  MODEL  statement 
database. 

4.3.3  Taraet  File  Statement. 


Purpose:  to  indicate  the  names  of  those  files  which  are 
tarqets  oi  outputs  of  the  desired  module. 

Syntax : 


<tarqet-f i 1 e -statement > : 


<keyword3>  [<keyword2>)  : 
<list-of-qnames> 

I <list-of-anames>  <is>  <keywotd3>  (<keyword2>) 


where : 

<keyword3>: : * TARGET  | TAR 

and  the  items  <keyword2>  and  <list-of-qnames>  has  the  same 
syntax  as  in  <source-f ile-statement>  described  in  section 

4.3.2. 

Semantics : The  qualified  names  qiven  by  <list-of-qnames> 


n 
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ate  tarqet  files  to  the  module. 

Example ; 

TARGET  FILE:  MASTER 

Note  that  a file  can  be  both  source  and  tarqet,  such  as 
a file  to  be  updated. 

If  the  source  and/or  tarqet  file  statements  were  not 
specified,  then  the  MODEL  processor  automatically  resolves 
the  files  used  in  the  module  as  source  or  tarqet,  if  any 

of  the  fields  in  that  file  is  used  as  source  or  tarqet 

respectively. 

4.3.4  Refer  Statement. 

Purpose : Adds  the  specified  sections  from  the  MODEL 

statement  data  base  to  the  MODEL  specification  for  the 
module. 

Syntax : 

<ref er-statement> : : * <keyword>:  <qualif ied-names> 
where : 

<keyword>::»  REFER  I REF 

Semantics : The  qualified  names  given  by  <oualif ied-names> 
ate  the  section  names  to  be  included  in  the  MODEL 

specification  of  the  module.  The  first  name  in  each 
qualified  name  is  the  module  name  in  which  the  section  was 
specified. 
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If  the  qualified  name  contains  only  one  name,  then  the 
name  is  assumed  to  be  the  module  name  and  all  the  MODEL 
statements  in  the  soecified  module  are  included.  Example: 

REFER:  DEPSALE. TRANS. GRP 

This  statement  includes  the  subsection  GRP  in  the 
section  TRANS  of  the  module  DEPSALE.  TRANS  can  be  any 
data  name,  and  GRP  is  one  of  its  descendants. 

Note  that  any  branch  of  the  tree  structure  representing 
a MODEL  soecif ication  can  be  referenced  be  teferrina  the 
node  in  the  tree  in  which  the  branch  starts.  For  examole, 
for  the  tree  structure  of  a MODEL  specification,  shown  in 
Figure  4.7,  the  statement: 

REFER:  DEPSALE . DATA_DESCRI PT ION . DISK .TRANS 
or  the  statement: 

REFER:  DEPSALE. TRANS ; 

can  be  used  to  refer  the  data  description  statement  of  the 
file  TRANS,  and  the  data  description  statements  of  all  its 


descendents 
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4.4  Data  Oesct i nt ion  Section. 

This  section  contains  the  followina  tyoes  of 

statements : 

(1) .  MEDIA  statement 

(2) .  FILE  statement 

(3) .  RECORD  statement 

(4) .  GROUP  statement 

(5) .  FIELD  statement 

(6) .  INTERIM  statement 

(7) .  SUBSCRIPT  statement 

These  statements  are  described  in  the  followina. 

4.4.1  MEDIA  Statement. 

Putoose:  MEDIA  statements  describe  the  media  in  which  the 

source  or  target  files  reside. 

Syntax : 

<media-statement>: : * <Iist-of-naTes>  f < i s > 1 <kevwotdl> 

[ (<arnument>  f , <araument>) *) ] 

where : 

<keywotdl>: :■  MEDIA  | MEO 

<argument>: :*  <common-arauments> 

I <taoe-arquments> 

I <terminal-ar auments> 

<common-anuments>: : » <blocksize-spec> 

I <tecotdsize-spec> 

I <orqanization-sDec> 

I <iecfm-spec> 


i 
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I <unit-spec> 

I <disp-spec> 

<tape-aiguments> : :=  <tape-label-3pec> 

I <parity-spec> 

I <chaicode-spec> 

I <tiks-spec> 

I <density-spec> 

I <int-label-spec> 

I <ext-label-SDec> 

<tei.minal-aiquments>: : * <tab-spec> 

I <paqe-spec> 

I <line-spec> 

<blocksize-soec>: :*  [<keywotd2>  [ < i s > 1 1 <integer> 

<keyword2> : : * BLOCKSIZE  | 3LKSIZE  | BS 

<recoidsize-spec>: : - (<keyword3>  [<is>]]  <integer> 

<keyword3> : : * RL  I RECORDSIZE  | RRCSIZE  | LRECL 

<oiganization-spec>: :*  {<keywotd4>  [<is>]]  <otq-type> 

<keywoid4>:  ;*  ORG  | ORGANIZATION! 

<org-tyoe> j : * SAN  | SEQ  I SEOUENTIAL 

I ISAM  | INDEXED  SEQUENTIAL  | INDEXED  | IS 
I DIRECT  I REGIONAL  | REG  | DIR  | REG1 
I REG2 
I REG3 

<recf m-spec> : : * {<keywoid5>  [ < i s > ) ] <recfm-type> 

<keywot d5> s : * RECFM  | RECORD^ FORMAT  | RF 

<recfm-type> ; s ■ FIXED  | F 

I FIXED  BLOCKED  I FB 
I VARIABLE  | V 
I VB  | VAR  BLOCKED 
I VAR  SPANNED  | VS 
I UNDEFINED  | U 

<unit-spec>s :■  {<keyword6>  (<is>11  <unit-type> 
<keyword6>  UNIT  | DEVICE 


< unit-tvpe> : : = PUNCH 
| CARD 

I TAPE  | 3400-4 
I 3400-1 
I DISK  | 3330 
I 3330-1 
I 2314 
I 2311 
I 2305 

I TERMINAL  | SYSOUT  | PRINT  | SYSPPIMT  |SYSIN 
I OLS 

<d isp-soec> : : = [<keywoi:d7>  f < i s > 1 J <disn-type> 

<keywoid7>: :=  DISP  | DISPOSITION 

<diso-type> : : = OLD  | SHR  | NEW  | MOD 

<tape-label-spec> : : = <keyword8>  [<is>]  <label-type> 

<keyword8> : : = TLABEL  | TAPE_LAREL  I TL 

<label-tyoe> : : = SL  | IBM_STD  | ANSI  STD 
! NL  | NONE 
I BLD  I PYPASS 

<pai ity-spec> : : = <keyword9>  f<is>]  <pat ity-tvop> 

<keywoi  d9>  s : * PARITY  | PAF: 

<pai ity-type> : : = ODD  I EVEN 

<charcode-3pec> : : * <keywoidl0>  f < i s > 1 <cc-type> 

<keywotdl0> : ; * CHAR  CODE  | CC  I CODE 


<cc-type> : : * EBCDIC  1 BCD  I ASCII 


<paqe-spec> ; : = 

<keywoidll> 

r < x s > i 

<inteqei > 

<keywoidll> j : = 

PAGESIZE  | 

P 1 LIN 

2S_PEP  PACE 

<1  ine-spce> : : * 

<keywordl2> 

f < is  > 1 

<inteqei > 

<keywovdl2>  s : * 

LINES1ZE  | 

WIDTH  | 

PAGE_WIDTH  | 

I LINE_WIDTH 

<tiks-soec>:  :* 

<kevwoidl3> 

[<is>l 

<trk-no> 
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<keywotdl3>:  :*  TRKS  | NO  TRKS  | TRACKS 

i 

<tr k-no> : : = 7 | 9 

<density-spec>: : = <keywordl4>  ( < i s > J <density-type> 

<keywotdl4> : : * DEN  | DENSITY 
<density-type> : : * 200  I 556  | 800  I 1600 
<int-label-spec>: : = <kevwordl5>  [<is>]  <name>  [,<name>]* 

<keywotdl5>  s : * IL  | INT_NAME  | I LABEL 

<ext-label-spec> : : = <keywordl6>  [ < i s > 1 <name>  [,<name>)* 

<keywordl6> : : = EL  | VOLSER  | VOLUME  | EXTNAME  | ELABEL 
<tab-spec> : : = <keywordl7>  [<is>J  <tab-Daims> 

<keywoidl7> : : ■ TAB 

<tab-patms> : : * (<inteqei>  f , < inteqet > 1 * ) 

Semantics : The  list  of  names  qiven  by  <1 ist-of-names> 
specifies  the  names  of  the  storage  medium,  in  which  the 
input/outout  files  reside.  The  aiouments  soecify  the 
attributes  of  the  media.  Thouqh,  some  of  the  arquments 
are  associated  with  a particular  file  in  the  media,  the 
attributes  that  are  common  to  all  files  in  the  media  can 

i 

be  specified  in  the  media  statement  itself.  In  the 
followinq,  a short  description  of  each  arqument  is  qiven. 

<blocksize-spec> 8 specifies  the  block  size  which  is 
equal  to  the  sum  of: 

(1) .  the  lenqths  of  all  records  in  the  block 

(2) .  any  control  bytes  required. 
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The  relationship  of  the  block  size  to  the  record  ifjiqth 
depends  on  the  record  format.  For  F3  format,  the 

i 

blocksize  must  be  a multiple  of  record  lenqth;  for  VB 
format,  the  block  size  must  he  equal  to  or  qteater  than  4 
+ the  lengths  of  all  records  in  the  block.  The  maximum 
value  of  the  blocksize  is  32760.  If  this  argument  is  not 
specified,  then  the  blocksize  is  assumed  to  be  the  value 
given  by  the  following  expression: 

recordsize  * max ( 1 , 6400/recordsize) 

<teco rdsize-spec>:  specifies  the  record  lenqth.  This 
is  the  sum  of: 

(1).  the  lenqth  required  for  data.  For  variable  lenqth 
and  undefined  length  records,  this  is  the  maximum 
record  lenqth; 

I (2).  4 in  the  case  of  variable  lenqth  records,  0 

otherwise. 

The  maximum  record  lenqth  is  32756  in  the  case  of  variable 
lenqth  records,  32760,  otherwise.  If  this  argument  is  not 
soecified,  then  the  record  lenqth  is  assumed  to  be  the 
I computed  lenqth  of  record. 

Cot<ganization-spec>:  specifies  the  record  organization 

in  the  file.  Three  types  of  record  organization  are 

t 

allowed : 

* 

(1)  Sequential 
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(2)  Indexed  sequential  (or  indexed) 

(3)  Reqional  (or  direct) 

There  are  three  sub-types  in  the  Reqional  type  of 
organization.  These  sub  types  are  indicated  as  REG1 , REG2 
and  REG3  (cor resoondinq  to  the  three  PL/1  types: 
Regional (1),  Regional (2)  and  Reqional (3)  respectively). 
Note  that  the  types  (2)  and  (3)  are  allowed  only  for  disk 
files.  If  this  arqument  is  not  specified,  then  the 
organization  is  assumed  to  be  SAM. 

<tecfm-spec>:  specifies  the  record  format  of  the  file. 
If  this  argument  is  not  specified,  then  the  record  format 
is  assumed  to  be  FIXED,  if  blocksize  * recordsize,  FB,  if 
Blocksize  * n*Recordsize  (where  n is  an  inteqer) 
UNDEFINED,  otherwise. 

<unit-spec>:  specifies  the  device  name  of  the  media. 
If  this  argument  is  not  specified,  then  UNIT  « OLS  is 
assumed . 

<disp-spec>:  specifies  the  disposition  of  the  data  set. 
If  this  argument  is  not  specified,  then  DISP  « OLD  is 
assumed . 

< tape- la be 1 -spec >:  specifies  the  type  of  label  of  a 
tape.  If  standard  label  tape,  then  the  internal  label 
name  must  be  specified  (see  the  description  of 
<int-label-spec>  below).  BYPASS  (or  BLP)  means  that  tape 
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label  processing  is  to  be  byoassed;  NONE  (or  NL)  denotes 

( 

no  tape  label. 

If  this  arqument  is  not  specified,  then  TL  = SL  is 
assumed. 

< 2a 1 i ty-soec > .•  <tr ks-spec>  and  <dens ity-soec>  specify 
parity,  number  of  tracks  and  density  of  the  taoe 
respectively.  If  any  one  of  these  arguments  is  not 
specified,  then  correspondingly,  PARITY  * 0D0,  TRKS  = 9, 
or  DENSITY  = 1600  is  assumed. 

<int-label-spec> : specifies  the  internal  label (s)  of 

the  taoe(s)  if  thev  ate  standard  labelled.  Each  label 
name  (<name>  in  <int-label-soec>)  should  be  6 characters 
long.  The  number  of  <name>s  must  be  eaual  to  the  numbeL 

i 

of  tapes  that  the  file  uses.  If  this  aroument  is  not 
specified,  then  the  internal  label  is  assumed  to  be  the 


same  as  the  external  label. 

<ext-label-spec>:  soecifies  the  external  label (s)  of 

the  tape(s).  Each  label  name  should  be  6 characters  long. 
The  number  of  <name>s  must  be  equal  to  the  number  of  tapes 
that  the  file  uses.  If  this  argument  is  not  specified, 
then  EL  * XXKXXX  is  assumed. 

<charcode-spec>:  specifies  the  character  code  of  the 


data.  If  this  argument  is  not  soecified,  then  CODE  « 
EBCDIC  is  assumed. 
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<page-soec> : specifies  the  page  size  (that  is,  number 
of  lines  per  page)  for  the  print  files.  If  the  specified 
size  is  > 66  then  a warning  messaqe  is  reported.  If  this 
argument  is  not  soecified,  then  PAGE_SIZE  = 60  is  assumed. 

<1 ine-spec> ; specifies  the  maximum  length  of  a line  for 
the  print  files.  If  the  specified  length  is  > 133,  then  a 
warning  messaqe  is  reoorted.  If  this  atqument  is  not 
specified,  then  line  size  is  assumed  to  be  the  same  as  the 
record  length. 

' <tab-spec> : specifies  the  tab  positions  for  the  print 
files.  At  most  10  tabs  can  be  specified  and  the  values 
must  be  in  ascending  order,  the  maximum  being  not  greater 
than  133. 

i 

Example 1 : 

INVDISK  IS  MEDIA(ISAM, DISK, FIXED, 180) 

The  above  statement  is  the  same  as  the  following 
statements 

INVDISK  IS  MEDIA (ORG-ISAM,  UNIT«DISK,  RECFM  * FIXED, 
BLOCKSIZE  » 180) 

Example2 : 

TRANS  IS  MEDIA (TAPE , 3200,  80,EL«X00010) 

The  above  statement  specifies  a standard  label  tape  (with 
external  label  » X00010),  the  descendant  files  of  which 
have  blocksize  of  3200  and  recordsize  of  80. 
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4.4.2  FILE  Statement. 

» 

Pul  pose : To  desciibe  a file  and  sor^e  of  its  attiibutes. 


Syntax : 

< f ile-statement > : : * <1 ist-of-onames>  [<is>] 

<keywoidl>  [(<paient-n3n'p>  [ , <at qiment>  1 * ) ] 

whet e : 


<1 ist-of-ananes>  has  the  sane  syntax  a3  the  con  espond  inq  1 

syntax  in  <souice-f ile-statenent>  and,  * 

<keywoidl>! :*  FILE  | REPORT  I FIL  | REP 

<atqument> : : ■ <common-atqu;nents> 

I <tape-ai quments> 

I <disk-aiqunepts> 

I <teiminal-ai quments> 

<common-arquments>  s : * <blocksize-spec> 

I <iecoidsize-spec> 

I <orqanization-soec> 

I <iecf m-spec> 

I <unit-spec> 

I <disp-soec>  f ^ 

I <key-soec> 

I <dsn-spec> 

<tape-aiqu,Tients> : : * <tape-lahel-spec> 

I <oar ity-soec> 

I <chatcode-sr>ec> 

I <tiks-spec> 

I <density-soec> 

I <int-label-spec> 

I <ext-label-spec> 

I <stattf ile-soec> 

<teiminal-aiquTents>: ;«  <tab-spec> 

I <Daqe-soec> 

I <line-spec> 

<disk-aiqunents>s : * <space-soec> 
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The  arquments  <stai t f i le-spec> , <key-soec>,  <dsn-spec>  and 
<space-SDec>  have  the  following  svntax: 

<staL t-f ile-soec>  : : = <keyword2>  f < i s > ] <inteqer> 

<keywotd2> : : = FILE#  | START_FILE 
<key-spec> : : = KEY  (<is>l  <qualif ied-name> 

<dsn-spec> : : = <keyword3>  (<is>]  <dsn-itself> 

<keyword3> : : = DSN  | DSNAME  | DATASET  | FILE_NAME 
<dsn-itself  > : : = <dsn-name>  [ (<member >) 1 
<membei > ::=  <name>  | f<siqn>]  <integet> 

<sign> : : = + I - 

<space-spec> : : * <keyword4>  ( < i s > 1 <soace-pars> 

<keywot d4> : : = SPACE 

<space-pats>: :=  ( <units>  ( , [ <inteqer > ) 

( , (cinteqet >]  [,<inteqer>) ]]  ([,) 

<rlse>]  ) 

<units>: : * TRACKS  I TRACK  | TRK 
I CYL  | CYLS 
I BLOCKS  | BLK 
I <integer> 

<r lse> ; : * R | RLSE  | RELEASE  I REL 

The  other  terms  in  <tape-arouments> , <common-arguments> 
and  <teiminal-arquments>  have  the  same  syntax  as  the 
corresponding  arquments  in  the  media-statement  described 
in  section  4.4. 

Semantics ; The  list  of  names  qiven  by  <1 ist-of-names> 
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specifies  the  names  of  the  inout/outDUt  files  used  in  the 
specification.  Each  <qname>  in  the  <list-of-qnames>  can 
contain  oneot  two  names.  In  the  former  case,  the  only 
name  in  <aname>  is  the  file  name;  and  in  the  latter  case, 
the  first  name  in  <qname>  is  the  name  of  the  media  (which 
is  the  parent  of  file)  and  the  second  name  is  the  file 
name  itself.  If  <qname>  contains  two  names,  then 

<parent-name>  shold  not  be  specified  (see  the  syntax  of  ^ 

file  statement)  in  the  argument;  otherwise,  the  oarent  . 

name  specifies  the  nameof  the  parent  media.  The  file  ‘ 

names  ate  analoaous  to  the  file  names  in  PL/1,  and  should 

not  be  confused  with  the  data  set  name.  The  data  set  name 

that  the  file  represents  is  given  by  the  arqument 

<dsn-spec>,  which  is  described  below.  Th*>  arnuxents  that 

are  common  for  media  and  file  statements  have  the 

semantics  descr  ibed  in  section  4.4.  The  semantics  of  the  t •" 

arguments  <star tf i le-soec> , <key“Soec>,  <dsn-spec>  and 
<space-SDec>  are  described  briefly  in  the  followinq: 

<star tf ile-soec> : soecifies  the  file  number  in  the  taoe  ‘ 

in  which  the  file  resides.  If  this  argument  is  not 
specified,  then  FILE!  * 1 is  assumed. 

<key-soec> : specifies  the  key  name  (which  is  given  by 
<qualif ied-name>  in  <key-soec>)  associated  with  the  file. 

If  the  file  name  is  sequential  file,  then  the  key  name 
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must  be  one  of  the  fields  in  the  file;  otherwise,  the  key 
name  can  be  any  interim  or  source  data  name  used  in 
specifyinq  the  module.  If  this  argument  is  not  specified, 
and  the  file  is  an  indexed  file,  then  the  first  field  name 
in  the  file  is  assumed  to  be  the  key  name. 

<dsn-spec>:  specifies  the  data  set  name  (given  by 

<dsn-name>).  The  dsn-name  consists  of  names  delimited  by 
the  character  the  syntax  of  which  is  given  below; 

<dsn-name> : : = <name> I . <name> ] * 

Each  <name>  in  <dsn-name>  can  be  at  most  8 characters 
lonq,  and  the  total  length  of  <dsn-name>,  including 
delimiters  is  44.  The  data  set  name  can  also  contain  a 
member  name  (given  bv  <name>  in  <member>>  if  the  data  set 
is  a Partitioned  data  set,  or  a generation  number  (given 
by  <integer>  in  <memher>)  in  the  case  of  a generation  data 
set.  The  member  name  or  the  generation  number  must  be 
enclosed  in  parenthesis  as  shown  in  the  syntax  of 

<dsn-itself >.  The  length  of  member  name  is  also  limited 

to  8 characters.  The  following  gives  a list  of  valid  data 
set  names; 

ABC. TRANS 
ABC. TRANS (JULY) 

ABC .TRANS (-1 ) 

If  this  argument  is  not  specified,  then  the  file  name 
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itself  is  used  as  the  data  set  name. 

<space-spec> : specifies  the  amount  of  space  to  be 

allocated.  This  arqument  is  allowed  only  if  the  file  is 

f 

an  output  file  with  disDosition  (DISP)  = NEW.  The  <units> 
in  the  <space-pats>  specifies  the  basic  unit  of  space 
allocation,  which  can  be  tracks,  cylinders,  or  an  integer 
which  indicates  the  numher  of  bytes.  If  RLK  (or  BLOCKS) 
is  specified,  then  block  size  is  used  for  the  <units>. 

The  first  <inteqer>  in  <space-pats>  indicates  the  number 
of  units  of  ptimaiy  allocation;  whereas  the  second  and 
third  inteqets  indicate  secondary  allocation  amount  and 
the  number  of  units  to  be  allocated  for  the  directory  (in 
the  case  of  partitioned  data  set).  The  last  term  <rlse> 
indicates  to  release  the  unused  space  inthe  data  set  at 
the  end  of  iob. 

If  this  arqument  is  omitted,  then  SPACE* (TRK , 10 , 1 , 1 ) is 
assumed.  . 

I * P 

Example : j 

INVDISK. INVEN  IS  FI LE ( DSN-OLD . INVEN , KEY=STOCK# ) 

This  statement  specifies  the  file  name  INVEN  (the  parent 
of  which  is  the  media,  INVDISK),  which  represents  the  data 
set  name  OLD. INVEN  and  the  key  name  of  the  file  is  the 
field  STOCK#. 

. 

Note  that  if  some  attributes  of  a file  is  specified  in 
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both  media  and  file  statements,  then  the  cor respondinq 
attribute  in  the  filr-  statement  is  used.  For  example,  the 
following  two  statements: 

DISKI  IS  MEDIA ( SAM , BLOCKS I ZE=6 4 00 ) 

FI  LEI  IS  FILE(DISK1 , BLOCKS I ZE* 3 2 00) 
are  equivalent  to  the  followinq  two  statements: 

DISK 1 IS  MEDIA (SAM) 

FILE1  IS  FILE (DISK 1 , BLOCKSIZE=3200) 

4.4.3  RECORD  Statement . 

Purpose:  to  describe  the  record  of  a file  and  its 

attributes. 

Syntax : 

<recor d-statemert> : : * <1 ist-of-dd-names>  (<is>] 

<keywordl>  [ (<parent-name>  [ , <arqument>) *) ] 

where : 

<keywotdl>: :*  RFCORD  | REC 

<argument> : : * <occ-desc>  I (<occ-desc>)  I <key-spec> 
<occ-desc> : : * <occ-list>  [ , <occ-list> 1 * 

<occ-list>: :*  (<occ> : 1 <occ> 

<occ>::«  * I <inteqer>  I <name> 

Semantics : The  name  in  each  <dd-name>  in  the 

<list-of-dd-names>  soecifies  a record  name,  the  attributes 
of  which  are  given  by  the  rest  of  the  statement.  The 


1 


patent  name  of  the  record  can  be  specified  either  by 
prefixing  the  parent  name  in  <dd-name>  itself,  or  by  the 
fiist  argument,  <oarent-name>. 

r 

The  <occ-desc>  specifies  the  size  and  dimension  of  the 
record.  The  dimension  of  the  record  is  aiven  by  the 
number  of  <occ-list>  in  <occ-desc>.  The  first  <occ>  in 
each  <occ-list>  indicates  the  lower  bound  of  the 
corresponding  dimension,  and  the  second  <occ>  indicates 
the  upper  bound  of  the  corresponding  dimension.  If  the 
first  <occ>  is  omitted,  then  1 is  assumed.  If  is  used 

to  specify  the  number  of  repetitions,  then  the  number  of 
repetitions  of  the  record  is  determined  by  the  end-of-file 
delimiter.  If  the  arqu.ment  <occ-desc>  was  not  specified 
at  all,  then  the  number  of  repetitions  is  assumed  to  be 

The  argument  <key-spec>  specifies  the  key-name,  if  any, 
associated  with  the  record.  The  semantics  of  this 
arqument  is  the  same  as  the  corresponding  argument  in 
<f ile-statement>. 

Example 1 : 

REC1  IS  RECORD  (INVEN,(*)) 

This  statement  describes  the  record,  REC1,  which  is  a 
descendant  of  a data  name  INVEN.  The  number  of 

♦ 

repetitions  of  the  record  is  to  be  determined  by  the 
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end-of-file  delimiter  or  by  setting  of  the  "ENDGRP" 
variable  ENDGRP. REC1  in  some  assertion. 

Example 2 s 

INVEN.REC2  IS  RLCORD(IO); 

This  record  statement  describes  the  same  record  name  as  in 
example  1,  however,  in  this  case,  the  number  of 
repetitions  of  the  record  is  fixed  and  is  equal  to  10. 

4.4.4  GROUP  Statement. 

Purpose:  To  describe  a group  in  a file  structure  (or  in  a 
record  structure)  and  all  its  attributes. 

Syntax : 

<group-statement>: :*  <1 ist-of-dd-names>  [<is>] 

<keywordl>  [ (<parent-name>  ( , <aiqument>1 *) 1 

where : 

<keywotdl>: :*  GROUP  | GRP  | GRO 

The  item  <argument>  has  the  same  syntax  as  the 

cotresoonding  item  in  RECORD  statement. 

Semantics : The  name  in  each  <dd-name>  in  the 

<list-of-dd-names>  specifies  a qioup  name,  the  attributes 
of  which  are  given  by  the  rest  of  the  statement.  The 
<argument>  has  the  semantics  as  the  corresoondinq 
<argument>  in  RECORD  statement. 
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Examplel : , 

GROUPl  IS  GROUP(INVEN.RECl, (10,20) ) 

This  statement  describes  the  attributes  of  a qrouo, 

GROUPl,  the  parent  of  which  is  qiven  by  the  qualified  name 
INVEN.REC1.  The  dimension  of  the  qrouD  is  2 and  the  size 
of  the  first  and  the  second  dimensions  ate  10  and  20 
respectively. 

4.4.5  FIELD  Statement. 

- - - - - ■■  »...  ■...!■  4 

Purpose : To  describe  a field  in  a qroup  or  a record 
structure  and  all  the  attributes  of  the  field. 

Syntax : 

<f ield-statement>: <1 ist-of-dd-names>  [<is>l 

<keywotdl>  [ (<oatent-name>  ( , <arqu.ment>]  *)  ] 

where: 

<keywotdl>: :■  FIELD  | FIE  I FLD 

<arqument> : : * <occ-desc>  I <field-desc> 

<f ield-desc>: :*  <char>  [(<lenqth>)]  [<var>] 

I BIT  ((<lenqth>)l  (<var>j 
I <bin>  [<f ixed-or-f loat>j  (<precision>) 

I <dec>  [<f ixed-or-f loat>j  [<orecision>] 

I (<dec>]  <f ixed-or-f loat>  [<ptecision>j 
I <oic>  ( '<picture> ' 1 I <num>  (<lenqth>) 

<char >: :•  CHA  | CHAR  | CHARACTER 

<lenqth>::»  * I <inteqer>  I <qualif ied-name> 
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<vaL > : : = VAR  | VARYING 

<bin> : :=  BIN  | BINARY 

<f ixed-ot -f loat> : : * FIXED  | FIX 

| FLOAT  | FLT  | FLO 

<Ptecision>: : = (Cinteaei > f , Cinteqei > ] ) 

<dec> : :=  DEC  I DECIMAL 

<oic> : : = PIC  | PICTURF 

<num>: :=  NUM  | NUMERIC 


The  item  <occ-desc>  has  the  same  syntax  as  the 
corresponding  item  in  RECORD  statement. 


Semantics : The  name  in  each  <dd-name>  in  the 
<1 ist-of-dd-names>  specifies  a field  name#  the  attiibutes 
of  which  aie  qiven  by  the  rest  of  the  statement.  The 
argument  <occ-desc>  has  the  same  semantics  as  the 
cortesoondinq  arqument  in  RECORD  statement. 

The  arqument  <field-spec>  specifies  the  tyoe  of  field 
the  name  represents.  There  ate  essentially  7 types  of 
fields : 


(1) .  CHARACTER 

(2) .  BIT 

(3) .  BINARY  (FIXED) 

(4) .  BINARY  FLOAT 

(5) .  [DECIMAL)  FIXED 

(6) .  DECIMAL  (FLOAT)  or  [DECIMAL)  FLOAT 

(7) .  PICTURE 
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The  <length>  associated  with  eithet  CHAR  type  or  BIT 
type  of  fields  soecify  either  (1)  fixed  length  string,  (2) 
dynamic  length  string,  ot  (3)  variable  length  string.  A 
data  is  a fixed  length  string  if  <integer>  is  soecified 
for  <length>.  A data  is  a dynamic  length  string  if  the 
<length>  is  specified  by  ot  by  the  aualified  name.  A 
data  is  variable  length  string  if  the  keyword  VARYING  is 
specified.  A dynamic  length  string  differs  from  variable 
length  string  in  that,  in  the  former  case,  the  length  of 
the  string  is  fixed  and  is  computed  dynamically  just 
before  the  dd-name  is  referenced.  In  the  latter  case,  the 
length  of  strina  is  variable,  however,  the  buffer  size 
allocated  for  the  string  is  the  maximum  size  specified  by 
<integer>  or  by  the  qualified  name. 

The  <ntecision>  associated  with  eithet  BINARY  or 
DECIMAL  data  specifies  the  number  of  digits  and  the  number 
of  fractional  digits  for  that  data.  For  example,  if  the 
precision  is  given  as  (10,2)  it  soecifies  10  diqits  for 
the  data,  two  out  of  which  specify  the  fractional  digits. 

The  descriotion  of  different  field  types  are  described 
further  in  the  following. 

CHARACTER  type  of  fields  specify  character  strings. 
Character  strings  includes  any  character  recognizable  by 
the  machine.  The  length  of  character  string  can  be  fixed, 
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dynamic,  or  varyinq.  If  <lenqth>  is  omitted,  then  the 
lenqth  of  the  character  strinq  is  assumed  to  be  one.  This 
field  type  defines  the  same  data  as  character  strinq  data 
in  PL/1. 

BIT  type  of  fields  sDecify  bit  strinqs.  These 
represent  strinqs  of  binary  diqits  (bits).  The  lenqth  of 
bit  strinqs  can  be  fixed,  dynamic  or  variable,  just  as  in 
the  case  of  character  strinq  data.  This  field  type 
defines  the  3ame  data  as  bit  3trinq  data  in  PL/1. 

BINARY  FIXED  field  types  reoresent  1 to  31  binary 
diqits  with  an  optional  binary  point.  <precision> 
specifies  the  number  of  diqits  in  each  field.  The  default 
precision  is  (15,0).  This  field  type  defines  the  same 
data  as  the  binary  fixed  point  data  in  PL/1. 

BINARY  fkOAT  field  types  specify  binary  floatinq  point 
data.  It  consists  of  strinq  of  diqits  with  an  optional 
exponent.  The  maximum  precision  allowed  for  this  type  of 
data  is  109j  and  the  default  precision  is  21.  This  field 

• ' 

type  defines  the  same  data  as  the  binary  floatinq  point 
data  in  PL/1. 

DECIMAL  FIXED  field  types  specify  decimal  fixed  ooint 
data.  It  can  represent  data  that  contains  1 to  15  decimal 
diqits  with  an  optional  decimal  point.  The  default 


precision,  assumed  when  no  specification  is  made,  is 
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(5,0).  This  field  type  defines  the  same  data  as  the 

( 

decimal  fixed  point  data  in  PL/1. 

DECIMAL  FLOAT  field  types  specify  decimal  floatinq 
point  data.  The  maximum  precision  allowed  is  33.  The 
default  precision  is  6.  This  field  type  defines  the  same 
field  type  as  the  decimal  floatinq  point  data  in  PL/1. 

PICTURE  field  types  specify  numeric  character  data  and 
some  special  chaiacter  string  data.  The  set  of  data  that  \ 

can  be  represented  by  picture  field  types  is  a superset  of 

the  set  of  data  that  can  be  represented  in  PL/1.  The  ; 

picture  specification  (which  is  given  by  <picture>) 

< 

consists  of  a sequence  of  character  codes  enclosed  in 
quotation  marks  which  is  part  of  the  picture  attribute. 

Picture  chaiacter  codes  ate  used  to  describe  the 
attributes  of  the  associated  data  item,  whether  it  is  the 

value  of  the  variable  or  a data  item  to  be  transmitted  ^ 

• 

between  the  proqiam  and  external  storage. 

: 

Picture  specifications  ate  of  two  types: 

It 

(1)  Character  string  soecif ication  and  \ 

(2)  Numeric  character  specification. 

A "character  stiinq"  pictured  item  is  one  that  consist 
of  alphabetic  characters,  decimal  digits,  afid  blanks. 

A "numeric  character"  pictured  item  is  one  in  which  the 
data  itself  can  consist  only  of  decimal  digits,  a decimal 


- 87  - 


point,  the  letter  E (for  exoonent),  and  optionally  a plus 
or  minus  siqn. 

There  ate  special  characters  that  can  be  used  to  edit 
either  character  strinq  data  or  numeric  character  data. 
These  characters  are  briefly  described  in  the  followinq. 
Except  for  the  picture  characters,  B,  T,  L,  and  P in 
character  strinq  data,  all  other  oicture  characters  in 
either  character  strinq  data,  or  numeric  character  data 
have  the  same  semantics  as  the  cor resoonding  oicture 
character  in  PL/1.  For  comolete  descriotion  of  those 
picture  characters,  the  reader  can  refer  to  the  PL/1 
reference  manual. 

Picture  Characters  for  Character  Strinq  Data : A picture 
specification  is  assumed  to  be  for  a character  strinq  data 
if  the  oicture  soecif ication  consists  of  at  least  one  of 
the  characters  "A"  or  "X",  or  if  it  contains  only  the 
characters  "B",  "L",  "P"  and  "T".  The  syntax  of  character 
strinq  data  can  be  specified  as  follows: 

<chat -picture-soeci f ication>: :=  <sinqle-item> 
[<sinqle-item> ] * {<oaddinq>] 

I <oaddinq> 

<sinqle-item>::=  [<reoetition>)<chars> 

<repetition>: :=  (<inteqer>) 

<chars>: : * X | A | 9 

<padding>::*  (<repetition>!  <pad-chars> 


! 
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[ [ <repetition> ] <pad-char s> 1 * 

<pad-char s> : : = ft  | L I P I T 

The  individual  oictuie  characters  used  in  the  picture 
specification  is  described  briefly  in  the  following: 

X Specifies  that  the  associated  position  can  obtain 
any  character  whose  internal  bit  conf iauration  can 
be  recognized  by  the  computer  in  use. 

A Specifies  that  the  associated  position  can  contain 
any  alphabetic  character  or  a blank  character. 

9 Specifies  that  the  associated  position  can  contain 
any  decimal  dioit  or  a blank  character. 

(3  Specifies  that  a blank  character  is  to  be  padded. 

L Specifies  that  an  end  of  line  character  is  to  be 
padded. 

P Specifies  that  an  end  of  oaqe  character  is  to  be 
padded. 

T Specifies  that  blanks  ate  to  be  nadded  till  the 
next  tab  position.  This  picture  character  is 
allowed  only  for  fields  in  a file  for  which  tabs 
were  specified  (see  also  the  description  of  the 
argument  <tab-spec>  in  MEOIA  statement).  If  no  tab 
was  specified  for  the  file,  then  this  picture 
character  is  ignored. 

Note  that  the  picture  specification  characters  9,  L,  P 
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and  T can  be  used  onlv  as  a suffix  of  any  combination  of 
A,  X and  9.  These  aie  used  specially  in  the  specification 
of  fields  in  the  print  files,  or  reports.  Some  examples 
ate  qiven  at  the  end  of  this  sub-section. 

Picture  Characters  for  Numer ic  Character  Data : The 
Dicture  characters  for  numeric  character  specification  may 
be  qtouoed  into  the  followinq  categories: 

(1)  Diqit  and  Decimal  point  soecifiers  (9  and  V). 

(2)  Zero  suppression  characters  (7.  and  *)  . 

(3)  Insertion  characters  .,  /,  and  B) . 

(4)  Siqns  and  Currency  Symbol  (S,  + , - and  $). 

(5)  Credit,  Debit  and  OvetDunch  siqns  (CR,  DP,  T,  I,  R 
and  Y). 

(6)  Exoonent  soecifiet  (K  and  E). 

(7)  Scalinq  Factor  (F). 

Since  the  above  picture  characters  have  the  same 
semantics  as  the  cotresoondinq  pictuie  character  of  PL/1, 

■ 

they  will  not  be  described  further  here.  The  descriDtion 

i 

of  these  Picture  characters  can  be  found  in  the  reference 
manual  for  PL/1.  However,  some  examples  shown  at  the  end 
of  this  sub-section  may  qive  an  overview  of  some  of  these 
picture  characters. 

The  NUMERIC  field  type  is  the  same  as  the  picture  field 
type,  the  picture  specification  of  which  contains  the 

__u 
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character  *9',  the  numbet  of  reoetition  of  which  is  qiven 

i 

by  <lenqth>.  For  example,  the  followinq  field 
specification : 

PICTURE  '999' 

is  equivalent  to  the  field  soecif ication : 

NUMERIC  (3) 

Examplel : ■ 

X IS  FIFLD(A,CHAP(10) , (20) ) 

This  statement  defines  a character  strinq  data  name,  X, 
which  is  a vector  of  20  elements,  the  lenqth  of  each 
element  beinq  ten.  The  data  name  A is  the  parent  name  of 
X. 

Example  2 : 

X IS  FIELD ( A ,BIT) 

This  statement  defines  a bit  strinq  data  name,  X,  which  is  i" 

a scalar  (dimension  * 0)  and  the  lenqth  of  strinq  is  one. 

I*- 

Example3:  j 

X IS  FIELD(A,BIN(15) ) 

This  statement  defines  a binary  fixed  ooint  data  name,  X, 
which  is  a scalar  and  the  precision  of  which  is  (15,0). 

Example 4 s 

X IS  FIELD  (A, DEC, (10,10) ) , 
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This  statement  defines  a decimal  fixed  point  data  name,  X, 
which  is  a matrix  of  size  10x10.  The  precision  of  each 
element  is  (5,0). 

Example 5 : 

X IS  FIELD  (A, PIC  'ZZV09  ') 

This  statement  defines  a numeric  character  data  name,  X, 
which  is  a scalar  and  has  a lenqth  of  4.  It  also  causes 
zero  suppression  for  the  first  two  digits  in  the  field. 
As  an  example,  if  the  value  1.23  is  assigned  to  X,  then 
the  character  representation  of  X is  criven  by:  * 2 3 * , 

where  j i represents  a blank  character.  The  picture 
character  Z is  the  zero  suppression  character,  and  the 
picture  character  V is  the  decimal  ooint  specifier.  In 
the  above  example,  the  decimal  point  is  assumed  to  precede 
the  riqht  most  two  digits. 

E x a mp 1 e 6 : 

X IS  FIELD(A, PIC '9,999 ') 

This  statement  defines  a numeric  character  data  name, 
which  is  a scalar  and  causes  an  insertion  of  a comma  if 
the  arithmetic  value  of  the  field  is  nreater  than  999.  As 
an  example,  if  the  value  1234  is  assiqned  to  X,  then  the 
character  representation  of  X is  given  by:  '1,234'. 
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4.4.6  INTERIM  Statement. 

t 

Putoose:  To  describe  the  attributes  of  interim  variable 
names.  Interim  variable  names  are  the  same  as  the  field 
names,  but  ate  not  contained  in  either  source  not  tarqet 
files. 

Syntax : 

Cintor im-statement>: :*  <list-of-dd-names>  [<is>] 

<keywotdl>  [ (<rest-of’  stmt>) 1 

where : 

<keywordl>: : = INT  | INTERIM 

<rest-of-stmt> : : * <parent-name>  [ ,<aiaument>] * 

I <aroument>  [ , <arqument> ] * 

The  item  <anument>  has  the  same  syntax  as  the 
cotiespondinq  item  in  FIELD  statement. 

Semantics:  The  arquments  of  interim  statements  have  the 
same  semantics  as  the  cor l esoond inq  arquments  in  the  field 
statements.  However,  Interim  data  names  may  not  have  any 

-ir 

parent. 

i 

Example : 

| 

TEMP  IS  INTERIM( (10, 10) , BIN  FIXED) 

| 

This  statement  defines  a binary  fixed  ooint  data  name, 

TEMP,  which  is  a matrix  of  size  10x10. 
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4.4.7  Subscript  Statement . 

f 

Purpose : To  describe  the  attributes  of  a subscriDt  name. 

Syntax : , 

<subsct ipt-statement>: : * <1 ist-of-names>  [ < i s > 1 
<keywordl>  ( ( <rest-of-stmt>) ] 

where: 

<keywordl> : : * SUB  | SUBSCRIPT 

<rest-of-stmt> : : * <parent-name>  [ , <arqument>] 

I <arqument> 

<arqument> : : * <integer>  [,[< inteqer > 1 (,  [<inteqer>l 
[ ,<inteqer > J 1 1 

Semantics:  The  names  in  <list-of-names>  specify  the 
subscript  names.  The  <par ent-name>  is  the  data  name  with 
which  the  subscript  is  associated.  The  first  three 
integers  associated  with  the  aroument  specify  the  lower 
bound,  uopet  bound  and  increment,  respectively,  associated 
with  the  subscript  name.  The  last  inteqer  specifies  the 
dimension  of  the  patent  name  with  which  the  subscript  name 
is  associated. 

The  default  values  for  the  lower  bound  and  uoper  bound 
are  the  cor resoondinq  bounds  of  the  subscripted  data  name 
specified  in  the  dd-statement.  The  default  value  for  the 
increment  is  1.  The  default  value  for  the  dimension 
(fourth  inteqer)  is  also  1. 
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Example : 

I IS  SUBSCRIPT (1,9, 2) 

This  statement  soecifies  a subset iot  name,  I,  the  tanqe  of 
which  is  qiven  by  the  set  of  indices  (1,3, 5, 7, 9).  If  an 
assertion  u:es  I as  a subscript,  and  no  " fot -clause"  is 
specified  for  that  asseition,  then  the  values  snecified  in 
the  subsetiot  statement  is  used  as  the  tanqe  fot  the 
subset ipt  in  the  assertion.  If  there  is  a fot-clause  in 
an  assertion,  then  the  tanqe  specified  in  the  fot-clause 
overrides  the  tanqe  qiven  in  the  subsetiot  statement.  Fot 
example,  the  above  subscript  statement  alonq  with  the 
followinq  assertion: 

B ( I ) * 3(1-1)  1 

is  equivalent  to  the  followinq  assertion: 

FOR  1*1  TO  9 BY  2 3(1)  = 3(1-1) 

4.4.9  Summary  of  Parameters  used  in  dd  statements. 

Table  4.1  qives  a list  of  all  parameters  (exceot 
patent-name,  tepetition  specification,  and  field 
soecif ication)  that  can  be  used  in  different  MODEL 
statements.  Second  column  in  the  table  soecifies  the 
statements  in  which  the  parameter  can  be  specified.  The 
third  column  soecifies  the  abbreviations  and  alternatives 
for  the  parametet.  Fourth  column  specifies  the  list  of 
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allowed  values  foi  the  parameter  (the  abbreviations  for 
these  values  have  not  been  shown).  Fifth  column  specifies 
the  default  values  for  the  argument.  And  finally,  the 
last  column  specifies  the  way  to  use  the  parameter,  either 
as  a positional  parameter  or  as  key-word  parameter.  If 
"EITHER"  is  specified  in  the  last  column,  either  form  can 
be  used.  Fot  example: 

ORG*ISAM 

can  be  alternately  specified  as: 

ISAM 


TAB  L t 4.1 


VABLC  4.1  (CONTI.) 
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4.5  Assertions  Sect  ion. 

This  section  contains  the  followinq  types  of 
statements : 

(1) .  Assertion  statements, 

(2) .  Source  Variable  Statements, 

(3) .  Tarqet  Variable  Statements, 

(4) .  Function  Statements, 

(5) .  Initial  statements, 

(6) .  Test  statements,  and 

(7) .  Solution  Method  statements. 

The  statement  types  (2),  (3),  (5),  (6)  and  (7)  above  are 
essentially  associated  with  the  simultaneous  equations. 
Statement  types  (2)  and  (3)  can  be  specified  even  for 
simple  assertions;  however,  they  are  not  necessary  if  the 
followinq  rule  is  satisfied; 

All  the  variables  on  the  LHS  of  an  assertion  are  the 
tarqet  variables;  and  all  variables  on  the  RHS  of 
the  assertion  (and  in  the  boolean  clause)  are  the 
source  variables. 

The  above  tvpe  of  statements  are  described  in  detail  in 
the  followinq. 

4.5.1  Assertions. 

Purpose:  To  specify  relationshios  between  data  names  used 
in  the  module  specif ication. 


Syntax : Except  for  the  <if-or-for-clause>, 


described 
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below,  the  syntax  of  an  assertion  statement  is  almost  the 
same  as  the  syntax  of  an  assignment  statement  in  PL/1. 
The  formal  syntax  of  assertions  is  given  below: 

<asser tion-statement> : : = f <oual i f ied-name> : 1 <assertion> 

<asser t ion> : : = <if-or-for-clause>  <assertion> 

f ELSE  <assertion>  ] 

I <variahle>  * <ar ith-expression> 

<if-or-for-clause>: :=  <if-or-for>  <for-expression> 

[ <then-or-let  > ) 

< i f-or-for > : : * IF  | FOR 

<for-expression>: : = <simnle-fot>  [<and-or-or> 
<for-exoression> 1 

<simple-foi >: :=  <ar ith-expression> 

[ <f rom-or-eq>  <arith-exoression> 
f <rest-of-bool > 

[ BY  <ar ith-exnression>  ]]] 

<f tom-or-eq>:  :=  FRO*1  | <tel-oDi> 

<rel-opt>::=  <l<=|>|>=|=|= 

<rest-of-bool>:  :=  TO  <ar.  ith-expression> 

I <rel-opr>  <arith-exoression> 

<and-or-or>: :*  & I I 

<then-or-let>: :*  THEN  | LET 

<at ith-expression>: : = [<mon-opr>J  <relational> 

<mon-opr  > : : * + | - | 

<relational>: :*  <orimary>  f<ooerators>  <relational>] 

<operators>: : ■ <tel-opr>  I <diadic-opr>  I <and-ot-or> 

<otimary>::»  ( <ar ith-exnression>) 

! <integer> 

I <float-data> 
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I <character -str inq> 

I <bit-strinq> 

I <vatiable> 

<var iable>: :*  <aual if ied-name>  (<subsctiots>] 

< subset ipts> : : « (<ai ith -express ion> 

[,<arith-expiession>]*) 

<diadic-opr >: : = * I / I + I - I **  I I I 

<character-str inq> : : = ' <any-str inq-of-character s>  * 

<bit-str inq> : : * ' <bits> [ <bits> ) * *B 

<bits> : : - 0 | 1 

<f loat-data> : : * [<siqn>)  <diqit-sti inq>  fE  [<siqn>l 
<stting-of-diqits> ) 

<siqn> : : * + | - 

<diqit-sti inq> : : * <str-of-diqits>  [ . [<str-of-diqits>  1 ] 

I . <str-of-diqits> 

<sti -of-diaits> : : * <digits>  [<diaits>]* 

In  the  above  BNF  soecif icat ion , the  assertion  is 
defined  recursively.  There  is  no  restriction  imoosed  on 
the  depth  of  recursion. 

Semantics : 

The  qualified  name  in  <asser tion-statement>  specifies 
the  name  of  the  assertion.  If  this  name  is  not  specified, 
then  the  system  automatically  supplies  a default  assertion 
name. 
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In  qenetal,  an  assertion  has  one  of  the  followinq 
forms : 

(1) .  <data-name>  = <exoression> 

(2) .  IF  <boolean-expression>  THEN 

<data-name>  = <exotession> 

(ELSE  <data-name>  = <exor ess  ion > ] 

(3) .  FOR  <fot-clause>  <data-name>  = <expression> 

Essentially,  an  assertion  statement  can  be  divided 
into  two  parts:  (1)  <i f-or-f or-clause>  and  (2) 

<simple-assertion>. 

i f-or-f or -clause  specifies  either  a boolean  expression, 
just  as  in  the  case  of  PL/1,  or  a set  of  subscript  ranqes 
for  the  subscripts  used  in  the  assertion.  The  oner a tors 
*1  * and  * & ' can  be  used  to  compose  compound  boolean 
expressions  ol  compound  subset iot  ranues. 

i f-or -for -clause  can  be  soocifie.-1  in  one  of  the  following 
ways : 

(1) .  A simple  variable  name.  In  this  case,  the  vaiiable 
name  must  be  defined  as  a bit  strinq  data. 

(2) .  A boolean  expression  (or  a loqical  expression)  usinq 
any  data  name  in  the  module  soecif ication. 

(3) .  Any  combination  of  (1)  and  (2)  with  the  use  of 

operators  ’I*,  's'  and  ' 

(4) .  A simple  subscriot  specification  of  the  form: 

FOR  I * 10 


I 
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(5) .  A subscript  tanqe  soecif ication  of  the  form: 

FOR  I * 1 to  u [By  cl 

(6) .  A subscript  tame  specification  of  the  form: 

FOR  1 <=  I <=  u [ BY  cl 

where  in  (4),  (5)  and  (6),  I is  a subsctiot  name,  and  1,  u 
and  c ate  the  lower  hound,  uooet  bound  and  increment, 
respectively,  associated  with  the  subscript. 

(7) .  Any  combination  of  (4),  (5)  and  (6)  with  the  use  of 
operators  ’I*  and  '&•. 

(8) .  Any  combination  of  (1)  thru  (7)  with  the  use  of  the 
operators  'I'  and  'fc'. 

The  first  three  types  in  the  above  list  specify  boolean 
expressions  associated  with  the  assertion  and  the  types 
(4),  (5),  (6)  and  (7)  soecify  the  subscript  ranges 

associated  with  the  subscripts  used  in  the  assertion. 

Some  examples  of  boolean  exoressions  are: 

A * 'NULL* 

A > 1000  | B < 20 
( A * 3 | C=*D)  & (E  = 1 ) 

Some  examples  of  for-clauses  are: 

I ■ 1 TO  9 BY  2 
1 • 1 TO  10  I I * 20  TO  30 

| 

1*1  TO  10  * J * 1 TO  5 
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Simple  Assertions  soecifv  the  relationships  between  the 
source  and  taroet  variables  used  in  the  assertion.  It  is 
in  the  form  of  an  eauation;  the  LHS  of  the  equation 
specifies  the  taiqet  variable  in  the  assertion  and  the  RHS 
specifies  the  source  variables  used  in  the  assertion.  LHS 
of  the  equation  must  be  a sinqle  data  name,  which  may  be 
subscripted.  RHS  of  the  equation  is  an  exmession 
composed  of  vai iable  names  (which  may  be  subscripted), 
constants  and  functions.  Some  of  the  built  in  functions 
like,  SUBSTR,  can  also  appear  on  the  LHS  of  the  equation. 
This  situation  is  exolained  in  later  sections. 

Some  examDles  of  simole  assertions  are: 

A = 3 + C 

A = 'THAT  IS  'll  'RIGHT' 

ARC (10)  = 100 

SUBSTR ( X , Y , 4 ) = 'JUNK* 


In  the  last  example,  the  data  name  X is  assumed  to  be 
the  taiqet  variable  and  Y as  the  source  variable. 

<character-3tring>  (in  the  production  <primarv>) 
consists  of  characters  enclosed  in  quotation  marks.  If  a 
sinqle  quotation  mark  itself  is  to  be  enclosed  in  the 
character  strinq,  then  the  quotation  mark  must  be  written 
as  two  sinqle  quotation  marks  with  no  interveninq  blanks. 
For  examole,  the  character  strinq  constant: 


105  - 


•SHAKESPEARE  ' 'S  ' * * 'HAMLET  ' ' ’ ' ' 
indicates  the  character  strinq:  SHAKESPEARE 'S  "HAMLET". 

<bit-stf ing>  consists  of  strinqs  of  diqits  0 or  1 
enclosed  in  sinqle  quotation  marks;  the  letter  B must 
follow  the  strinq  immediately. 

<f loat-data>  recoqnizes  decimal  floatinq  ooint  data 
which  consists  of  a strinq  of  diqits  (with  an  optional 
decimal  point)  and  an  optional  exponent.  Some  valid 
floatinq  ooint  data  constants  are: 

12.34E-5 
. 1 2B-10 
12.000 
12.34 

Examplel : 

ASS1 : IF  A * B THEM  C » D ELSE  C = E(X) 

This  assertion  contains  a boolean  expression  HA*B"  and  two 
simple  assertions  "C*D"  and  "OE(X)".  The  tarqet  variable 
associated  with  this  assertion  is  C and  the  source 
variables  are  A,  B,  D,  E and  X.  The  name  ASS1  is  the 
assertion  name. 

Example2 : 

FOR  I « 1 TO  10  A ( I ) « A(I-l) 

This  assertion  contains  a for-clause  soecifyinq  the  ranqe 
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of  subset iot  I used  in  the  simple  assertion  "A ( I ) =A ( 1-1 ) " . 
Soutce  variable  of  this  assertion  is  A ( I ) for  I * 0 to  9, 
and  the  tarqet  variable  of  the  assertion  is  A (I)  for  1=1 
to  10.  Assertions  with  subscripted  variables  ate 
described  in  the  later  sections. 

Exam ole 3 : 

IF  C = 3 S I = 1 TO  10  THFN  A(I)  = A(I-l) 

This  assertion  contains  a boolean  expression  "C=B"  and  a 
for-clause  "1=  1 to  10".  The  target  variable  of  this 
assertion  is  A ( I ) for  I = 1 to  10;  and  the  source 

variables  are  C,  B and  A(I)  for  I = 0 to  q. 

4.5.2  Soutce  Variable  Statements . 

Purpose;  To  specify  the  soutce  variable  names  used  in  an 
assertion.  This  statement  is  needed  onlv  in  defining  a 
set  of  simultaneous  statements,  or  when  the  rule  qiven  in 
section  4.5  is  not  applicable. 

Syntax; 

<source-var-statement> ; ; = <asse rt ion-name > : 

<keywotdl>  ; <1 ist-of-var iables> 

where : 

<keywordl>: :=  SOURCR  I SOU 

<list-of-variables>; ;■  <variable>  [ , <vat iable> 1 * 
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and  the  item  <variable>  has  the  same  syntax  as  the 
cot responding  item  in  assertion  statement  (see  sub-section 

4.5.1) . 

Semantics : The  viuiables  qiven  by  <1 ist-of-var iables>  ate 
the  source  variables  for  the  assertion  oiven  by 
<asser t ion-name> . As  in  assertion  statement,  an  assertion 
name  can  be  a qualified  name.  If  no  source  variable 
statement  is  soecified  for  an  assertion,  then  all  the 
variables  in  the  boolean  expression  and  RHS  of  the 
assertion  are  assumed  to  be  the  source  variables. 

Example : 

ASS1:  SOURCE:  A,B,C 
X « A + n + C ( 1 234 ) 

The  first  statement  in  the  above  example  specifies  that 
the  variables  A,  B and  c used  in  the  assertion  ASS1  are 
the  source  variables.  The  second  statement  specifies  the 
assertion  itself. 

4.5.3  Target  Var iable  Statements. 

Purpose:  To  3pecifv  the  tarqet  variables  used  in  an 
assertion.  This  statement  is  needed  only  in  defininq  a 
set  of  simultaneous  statements,  or  when  the  rule  qiven  in 
section  4.5  is  not  applicable. 
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.Syntax  : 

< tarqet -vat  -statement : : * <as set  t ion-name > : 
<keywordl>  : <1 ist-of-var iables> 

where : 

<keywotdl > : : = TAR  | TARGET 


and  the 

item  <li 

st-of-var 

iables> 

has  the  s 

ame  syntax 

as 

the  cone 

spondinq 

item  in 

source 

variable 

statement  ( 

see 

section  4 

.5.3)  . 

Semantics 

: The  va 

riables  q 

iven  by 

the  <list 

-of-vaj 

•iabl 

es> 

are  the 

taroet  variables 

of  the 

assertion 

, the 

name 

of 

which  is 

oiven  by 

<asset  ti 

on-name  > 

• 

If  no 

tarqet 

va liable 

statement  is  specified 

for 

an 

assertion 

, then 

all  the 

vat iabl 

es  on  th 

e IjHS 

of 

the 

assertion  ate  assumed  to  be  the  tarqet  variables. 

Example : 

ASS  1 : TARGET;  X 

This  statement  specifies  that  the  variable  X is  the  tarqet 
of  the  assertion  ASS1. 

4.5.4  Function  Statements. 

Purpose:  To  specify  the  function  names  used  in  an 
assertion,  or  in  the  model  specification.  This  statement 
is  used  only  in  the  definition  of  a set  of  simultaneous 


. 


equations 


Syntax : 

<funct  ion-statement  >:  s * f <asser tion-name>  : 1 
<keywordl>  : <1 ist-of-f unctions> 

where: 

<keywordl>: :■  FUNCTION  I FUN 

<1 ist-of-f unctions> : s * <name>  l (<arquments>) ] 

<arquments>:  :■  <field-desc>  [ , <f  ield-desc>  ] 

The  item  <field-desc>  has  the  same  syntax  as  the 
cot iespondinq  item  in  field  statement  described  in  section 
4.4.5. 

Semantics : The  names  in  the  <list-of-functions>  specify 
the  function  names  used  in  the  module  specification  (if  no 
assettion  name  was  specified),  or  in  the  particular 
assertion  name  of  which  is  qiven  by  <asser tion-name>. 

The  <arquments>  associated  with  each  function  name 
specify  the  attributes  of  the  arquments  of  the  function. 
As  in  the  case  of  PI./1,  these  arguments  are  requited  if 
the  function  is  in  a library  (that  is,  not  defined  by  the 
user  in  the  module  specification)  and  if  the  attributes  of 
the  arquments  of  the  function  differs  from  the  attributes 
specified  in  the  usaqe  of  that  function.  The  attributes 
of  an  argument  is  3ame  as  the  field  type  of  a data  name 


no  - 


which  is  described  bv  the  item  <field-desc>  in  section 

4.4.5. 

Examole : 

FUNCTION:  COMPUTE (CdAR  (4) ) 

This  statement  soecifies  the  attributes  of  the  one  and 
only  one  argument  of  the  function  COMPUTE  which  is  a 
character  string  of  length  4. 

4.5.5  Initial  Statements . 

Purpose:  This  statement  is  used  to  specify  the  initial 
values  of  the  source  variables  used  in  a set  of 
simultaneous  eauations. 

Syntax : 

<initia 1-statement >:: = <a3sert ion-name > : 

<keywordl>  : <simole-assertio.os> 

where : 

<keywordl>: :=  INIT  I INITIAL 

<simple-assertions>: : * <variable>  * <at ith-exoi ession> 

[,<vatiable>  * <ar ith-exoression>l * 

The  items  <vatiable>  and  <arith-exoression>  has  the  same 
syntax  as  the  corresponding  statements  in  assertion 

statements  described  in  sub-ection  4.5.1. 

‘ 

Semantics:  The  set  of  values  soecified  by  each 

" 
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Or  ith-expression>  in  the  Oimole-asser  tions>  SDecifles 
the  set  of  initial  values  for  the  variables  specified  on 
the  LHS  of  the  cotrespondina  simple  assertion.  The 
variables  used  in  the  LHS  of  each  simDle  assertion  must  be 
used  in  the  assertion,  the  name  of  which  is  qiven  by 
Osset  t ion -name >. 


i 


Example : 

SIM : INIT:  X0*1,  Y0*0 

The  above  statement  specifies  the  initial  values  of  1 and 
0 for  the  variables  xO  and  vO  resoectively , which  are  used 
in  the  assertion  SIM. 


4.5.6  Test  Statements. 

l 

Purpose : These  statements  are  used  to  soecifv  the  test 
values  associated  with  each  variable  in  a set  of 
simultaneous  eouations.  These  test  values  are  used  to 
terminate  the  iteration  in  the  solution  method  adopted  for 
solvinq  the  set  of  simultaneous  equations. 


Syntax : 

<test-statement>: : ■ Osser tion-name>  : 

TEST  : <simole-assertions> 

whete  oimole-asser tion>  has  the  same  syntax  as  the 
cot respondinq  term  in  initial  statement  described  in 
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section  4.5.5. 

I 

Semantics : The  <ai i th-exoression>  in  each  simole  assertion 
specify  the  test  values  foL  the  col  responding  variable  on 
the  LHS  of  the  simole  assertion. 

Example:  SIM:  TFST  : X0=  .0002 

This  example  specifies  the  test  value  of  .0002  for  the 
vaiiable  X0  foi  the  termination  of  iteLation  in  solvinq 
the  assertion  SIM. 

4.5.7  Solution  Method  Statements. 

Purpose:  These  statements  describe  the  solution  method  to 
be  used  to  solve  the  set  of  simultaneous  eouations. 

Syntax : 

<solution-stmt> : : * <asset t ion-name>  : 

<kevwordl>  : <Darameters> 

where: 

<keywoidl > : : * SOL  | SOLUTION  -y 

<paiameteis>:  :=  <na.me>  f ,<arith-expression>l  * 

r 

and  the  term  <ar ith-exoi ession>  has  the  3ame  syntax  as  the 
correspondin'?  item  in  assettion  statement  described  in 

’ 

section  4.5.1. 

Semantics:  The  name  in  <oarameters>  is  the  solution  method 


113 


i 


to  be  used  for  the  assettion,  the  name  of  which  is  qiven 
by  <asser tion-name>.  If  second  parameter  is  specified, 
then  it  specifies  the  maximum  numbet  of  iterations  to  be 
petformed  in  the  solution  method.  Only  two  parameters  is 
currently  being  used. 

Example : 

ASS:  G S,  10 

The  above  statement  specifies  that  Gauss-Seidel  solution 
method  be  used  and  the  maximum  number  of  iterations  to  be 
performed  is  10. 

4.6  Help  Statement. 

In  addition  to  the  statements  described  in  the  earlier 
sections,  one  additional  statement  is  allowed  in  MODEL. 
This  statement,  called  HELP  statement,  is  used  in  aiding 
the  user  by  disolaying  the  syntax  any  statement  in  MODEL, 
or  by  displaying  the  descriDtion  of  an  error  code  reported 
by  the  processor.  The  semantics  of  HELP  statement  is 
described  in  detail  in  the  next  chaoter.  In  the 
following,  the  purpose  and  syntax  of  the  statement  is 
qiven. 

Purpose:  To  aid  the  user  with  the  .description  of  an  error 
code  oi  the  description  and  syntax  of  a particular  MODEL 
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statement . 


Syntax : 

<help-statement> : : = HELP  [<arqument>) 
<arqument> : : = MODEL  | 

SYNTAX  | 

<statement-type>  I 
<er ror-code> 


4.7  Keywords  in  MODEL. 

Currently,  only  the  following  keywords  are  used  in 
MODEL:  ABSENT,  ^DORP,  PMIST,  LENGTH,  POINTER, 

and  SUBSET. 

All  the  above  keywords  are  used  as  the  hiqhest 
qualifier  for  a data  name,  like  in,  POINTER. MAST, 
SUBSET. REPORT,  and  so  on.  The  significance  of  each  of  the 
above  keywords  is  briefly  described  in  the  following. 

4.7.1  The  keyword  A 3 SENT. 

The  keyword  absent  is  associated  with  a file  name, 
group  name  or  a record  name.  It  is  used  to  detect  the 
"absent"  condition  of  the  data  name  that  follows  it.  For 
example,  if  MAST  is  a file  name,  then  ABSENT. MAST  is  a bit 
variable  that  is  equivalent  to  the  following  on-unit  of 
PL/1: 

UNDEFIMEDF ILF (MAST) 

Similarly,  if  RnC  is  a record  in  an  ISAM  file,  mast,  then 
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ABSENT. RFC  is  also  a bit  vaiiable  which  is  equivalent  to 
the  followinq  on-unit  of  PL/1: 

KEY (MAST) 

4.7.2  The  Keyword  ENDGRP. 

The  keyword  ENDGRP  is  used  to  terminate  an  iteration 

cot tesoonding  to  a reoeatinq  data  name.  For  examole, 

consider  the  set  of  MODEL  statements  shown  in  Fiqure  4.8. 

G1  IS  GROUP ( * ) ; 

G2  IS  GROUP (G1 , (*) ) ; 

F IS  FIELD (G2) ; 

T IS  INTERIM (*); 

IF  F ( I ) = F ( 1+1 ) THEN  ENDGRP. G2  * 'i'B; 

ELSE  ENDGRP. G2  * ’O’B; 

T « ANY ( F) ; 

Figure  4.8.  An  example  using  the  keyword  ENDGRP. 

If  the  seauence  of  elements  in  G1  is  given  by: 

10  12  12  12  13  13  16  20  20 

then,  the  atouo  G1  repeats  5 times  and  the  five  qrouos  in 
G1  ate  given  by  the  followino  sets  of  elements:  (10), 
(12,12,12),  (13,13),  (16)  and  (20,20).  T will  therefore, 
contain  the  set  of  elements:  (10,12,13,16,20). 

4.7.3  The  Keyword  ENDFILE. 

The  kevword  ENDFILE  is  associated  with  a file  name  to 
detect  end  of  that  file.  For  example,  if  MAST  is  a file, 


116  - 


then  ENDF ILF. MAST  is  a hit  variable  which  is  equivalent  to 
the  following  on-unit: 

ENOFILE (MAST) 

4. 7. 4 The  Keyword  EXIST. 

The  keyword  EXIST  is  used  to  indicate  the  size  of  a 
repeating  data  name.  For  example,  for  the  set  of 
statements  i,n  Figure  4.8,  the  user  could  explicitly  give 
an  exoression  for  EXIST. G1  or  EXIST. G2,  like: 

EXIST. G2  = 10; 

In  this  case  the  Group  is  assumed  to  he  repeating  10 
times. 

4.7.5  The  Keyword  LENGTH. 

The  keyword  LENGTH  is  used  to  indicate  the  length  of  a 
string.  For  example,  if  the  field  X is  defined  by  the 
following  statement: 

X IS  FIELD  (CH^R  (*) ) ; 

Then,  LENGTH. X can  he  used  to  indicate  the  actual  length 
of  X. 

4.7.6  The  Keyword  POINTER. 

The  keyword  POINTER  is  used  to  select  a Particular 
member  of  a repeating  group  or  record.  Most  freauent  use 
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of  this  keyword  is  to  select  a record  of  a keyed  file. 

For  examole,  if  MAST  is  a keyed  file,  with  the  key  name 
PRODUCT  NUMBER,  then  the  following  assertion: 

POINTER. MAST  * TRANSACTION . PR0DUCTNUM8ER ; 
will  select  that  record  in  file  MAST,  which  corresponds  to 
the  PROOUCT_NUMBER  in  the  file  TRANSACTION. 

4.7.7  lil?  Keyword  SUBSET. 

The  keyword  SUBSET  is  used  with  a file  name,  to  select 
a subset  of  records  in  that  file.  For  example,  if  the 
following  assertion  was  specified: 

IF  <logical-exDression>  THEN  SUBSET. MAST  * SELECTED; 

ELSE  SUBSET. MAST  = SELECTED; 

And,  if  MAST  is  a source  file,  then  a record  is  read  from 
MAST  only  if  SUBSET. MAST  is  set  in  the  above  assertion. 

However,  if  MAST  is  a taroet  file,  then  a record  is 
written  to  that  file  only  if  SUBSET. MAST  is  set  in  the 
above  assertion. 
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4.8  An  Hlustrative  ExaTole. 

i 

We  will  resort  to  an  examole  to  illustrate  the  use  of 
MODEL.  This  examole  is  taken  from  the  book  bv  McCracken 
and  Gatbassi  (McC70) . It  envisages  an  environment  of  a 
sales  organization  with  a number  of  salesmen  and  extensive 
inventory.  The  user  of  MODEL  is  envisaged  to  be  a sales 
analyst  wishing  to  soecify  the  requirement  for  sales 
reporting.  Fiqure  4.9  illustrates  in  block  diagram  form, 
the  requirement  which  is  a subset  of  the  case  study  in  the 
reference  (McC70).  The  program  module  to  be  produced  by 
the  MODEL  Processor,  as  shown  at  the  center  of  Fiqure  4.R,  y 

is  named  RIM  1.  It  will  maintain  year-to-date  sales  in  a 
file  named  MASTER.  Sales  information,  consistim  of 
PR0nUCT_NUM9ER,  QUANTITY,  SALESMEN  and  DISTRICT  is 
obtained  from  the  file  na.meed  TRANSACTION.  The  REPORT  1 
file  consists  of  monthly  sales  of  each  product.  The  case 
study  in  the  reference  includes  an  additional  reoort  which 
is  omitted  here  for  sake  of  simplicity. 

The  MODEL  specification  for  the  pronram  module  RUN  1 is 
shown  in  Figure  4.10.  The  first  three  statements  in  this 
fiqure  represent  the  header  section.  Statements  4 thru  10 
define  the  file  MASTER,  and  its  comoonents.  Statements  11 
thru  18  define  tne  file  TRANSACTION  and  its  comoonents. 

Statements  19  thru  31  define  the  file  REPORT  1 and  its 
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components.  The  subscript  name  J is  associated  with  the 
repeating  data  name  TRANS  on  statement  32.  And  finally, 
the  assertions  in  statements  31  thru  3*1  define  the 
relations  between  the  data  items. 

The  TRANSACTION  file  is  tea aided  as  consistinq  of 
qtoups  of  records.  F.ach  qrouo  corresponds  to  a particular 
PR0DUCTNUM3ER.  The  assertion  on  statement  39  indicates 
the  upoer  bound  of  repetition  for  the  group  TRANS.  In 
this  case,  the  upoer  bound  of  the  subscript  J is  equal  to 
the  number  of  transaction  records  cor respondinq  to  the 
same  PR0DUCT_NUM3ER.  This  tvoe  of  relation  is  defined  by 
the  variable  name  that  beqin  with  the  kevword  ENDGRP  (see 
also  section  4.6). 

The  function  SUN  used  in  statement  33  is  used  to  obtain 
the  sum  of  product  totals.  Note  that  SUM  is  a reduction 
function. 

The  function  ANY  used  in  statement  35  is  also  a 
reduction  function.  This  function  is  required,  because,  V 

we  need  to  read  only  one  Master  record  for  everv  qrouo  of 
transactions  (that  have  the  same  PR00UCT_NUM3EP)  . 

The  above  example  will  he  referred  in  the  discussion  of 

/ 

tne  MODEL  Processor  in  the  chapters  that  follow.  In 

i 

particular,  the  generated  reports  from  the  specification 
of  RUN  1 in  Figure  4.10  will  be  described  in  Chapter  10. 


Figure  4.  9 # An  Overview  of  a simple  Data  Processing 
Problem. 
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statement 

NUMBER  _ _ 

1 MOOJLES  RUN.lI 

_2 SOUP  CE J TRANSACT  ION, MASTER  S 

3 TARGET:  REPORT..!  .MASTER  S 

A TAPE  IS  MFDlAITAPFj; 

5 MASTER  IS  FILE!  TAPE. SAM, KEY«PROC>LCtLnUMBER»  5 

_6 MAST  IS  RECORD! MASTER  Is 

7 PROOUCT.NUMOEP  IS  F I EL!)  C "AS  T , P I C *X(  5 ) • I ! 

_ 8 _ UNI T.PR ICE  IS  FIELOIMAST.PIC '9V  9 VO 9 ' I ; 

9 Y TO.  SAL  ES  IS  F I ELl)  I MAS  I » P I C ' 9(  5 I V9V ' I S 

10  _ FILLER  IS  F I EL  0 I MAST .CHAP (6 3 ) I ; 

'll 

11  TRANSACTION  IS  F I LE I T APE , SAM , KEY  * PRODUCT  NUMBER  I 5 

12  " P°ODX  T IS  GKO  UP  I TR  AKSA  CT I ON,  I * I ) 5 

13  TRANS  IS  RECURO I PF CUUCT . I* M 5 

1 A PPOD'XT_NUMdrc  IS  F IELOI  TRANS,  PIC'X  (SI*  I ; 

15  QUANTITY  IS  F I ELDI TRANS # PI C • 999 • 1 5 

16  ' SALESMAN  IS  F I ELDI  I R ANS , PIC  * XX • I S 

17  DISTRICT  IS  FIELDITRANS.PIC* Y* | ; 

18  FILLER  IS  F I ELUI  TRANS, CHARI  69) ) ; 

19  

19  PRINTER  IS  MEDIA IPF 1 NT } < 

20  KEPOfT.l  IS  FILElPf inier.sam.ke  y*  prcduc t.nu mber i ; 

21'  "P  AGE  IS  -G»0'JP « KEPT’ RT_1)  ; 

22  HEADING  IS  RECOR  J(  PAGF)  ; 

23  PR3D'JCT_  10  IS  F ILL  01  HEA  OlNG  . PIC  • Xt  1C  I • I S 

2A PRODUCT.TOTAl  I S F 1 ELOI HEAD  I KG, P I C* X 1 1A I • I I 

"25'  ' PROOXT.YTO  IS  FIFcGlnFADING.PI C ‘XI 1AI ') I 

26  RFP.l  IS  RECORD! PAGE  ,1  * I ) ; 

"?1  F-iL  LEO l IS  F I El  0 IR FP_l  , R 1C • X I 5 I • I ; 

28 PROOX  T .NUMBER  ISFIELOIREP  l , P 1 C • X ( 5 I • I ; 

29  " FILLER  IS  FIELOIREP.t.P IC'Xl  51  • ) ; 

iO_  __  PRO  OUCT .TOT AL.F OR.MCNTH  IS  F IEL C IRE P.i , P 1C  III l 2.99  ' I J 
31'  YTD.SALLS  IS  F I ELO I REP.l  ,P  I C 'll  l ll.  99  • I I 

32  J IS  SUBSCRIPT  IT^ANS)  5 

'33  " 

33  PROOJCT.TOT  AL.F OR.MONTH  « SUM! Q JANT I T Yl J I * JNI T.PRIC E I J 

3A  TAR GET.YTD. SALES  » S0UPCE.YT3_SALFS»P°0UUCT_T0TAL_FCR_M0NTHS 

35  POINTER. MAST  « ANY  I PKQOXT  . PRO  CLCT. NUMBER  I • 

36~  PROOUCT.IO  * • PPOO'JCT  • ; 

3T  PRODUCT.TOTAl  ■ ‘MOKTH  SALES*; 

38  PROD  jCT'.VTD*  ■ • YTI)  SALE  S ' ; ” 

39  I F_ TP ANS. PRODUCT .NUMBER  I J)  -«  T R ANS . PRC OUC T.NUMUER I J *1 1 THEN  

19  ENDGRP. TRANS  - TRUE 

39 ELSE  ENOGRP. TRANS  • FALSE 


Figure  4.10.  MODEL  Specification  for  the  data 

Processing  problem  shown  in  Figure  4.9. 
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CHAPTER  5 
SYNTAX  ANALYSIS 

This  chanter  brieflv  describes  the  functions  of  the 
syntax  analvzet  in  the  MODEL  Processor.  The  Syntax 
Analysis  Proqtam  (SAP)  foi  MODEL  is  qenerated  by  the  meta 
processor  called  Syntax  Analysis  Proqtam  Generator  (SAPG) . 
The  SAPG  used  heLe  was  developed  bv  the  DDL  Pro-ject  at  the 
Moore  School,  and  was  found  invaluable  in  the  development 
of  the  MODEL  Processor.  In  the  followinq,  SAPG,  SAP  and 
the  suppoitinq  routines  used  by  SAP  are  briefly  described. 

5.1  Generation  of  SAP  usin^  SAPG  and  EBNF. 

The  overview  of  SAPG  is  shown  in  Fiqure  5.1.  As  shown, 
SAPG  produces  SAP  from  the  SDecif ication  of  the  ^ODFL 
lanquaqe  in  the  meta  lanquaae  - "Extended  Backus  Normal 
Form"  ( ERNF ) . The  EI3NF  includes  the  traditional  concepts 
of  RNF  and  the  followinq  extensions: 

(1) .  The  meta  symbols  ( and  ] ate  used  to  represent 

ontionality,  and 

(2) .  An  asterisk  immediately  followinq  M represents 

repetition  of  the  item  inside  the  brackets  zero 
or  more  times. 
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Other  Routines 
of  MODEL 


MODEL 

Statements  I 

•i 


Figure  5.1 
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Overview  of  SAPG,  SAP  and  MODEL  Processor. 
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In  addition  to  the  above  extensions,  SAPG  piovides  a 
capability  to  call  some  user  supplied  subroutines  uoon 
successful  recognition  of  a syntactic  unit.  Such 
subroutine  names  are  enclosed  in  the  EANF  itself  by 
enclosinq  them  in  slashes. 

The  subroutines  embedded  in  EBNF  are  of  three  types: 

(1) .  Recoqnizer  Routines, 

(2) .  Supnor tino  Routines, 
and  (3).  Error  Stacking  Routines. 

A routine  is  a recognizer  routine,  if  on  the  RH3  of  a 
production  in  which  that  routine  is  used  contains  only  the 
routine  and  nothing  else.  For  example,  in  the  Production: 
<!v,A'1E>  : : = /RECLAME/ 

the  routine  P.FCNAME  is  a recognizer  routine.  These 
tecoqnizer  routines  usually  recognize  different  seouences 
of  terminal  symbols.  » 

The  Supporting  Routines  embedded  in  the  E3VF  are  those 
routines  that  encode  and  store  the  statements  obtained 
from  the  user  in  the  associative  memotv. 

The  Error  Stacking  routines  arc  used  to  stack  error 
codes.  One  error  code  is  stacked  for  each  expected  token 
in  the  input.  If  an  expected  token  was  found,  the  SAP 
automatically  poos  the  error  code  in  the  too  of  the  error- 
stack.  Otherwise,  the  error  code  in  the  too  of  the  stack 
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is  used  to  inform  the  user  that  a correspondinq  token  was 
missinq  in  the  input. 

r 

It  is  difficult  to  distinquish  between  a supportinq 
routine  and  an  error  stackinq  routine  in  an  EBNF. 

However,  the  difficulty  has  been  eliminated  in  the  EBNF 
foi  MODEL  (see  section  5.8),  by  usinq  the  followinq 
convention:  the  name  of  every  error-  stackinq  routine 

starts  with  the  letter 

Because  SAPG  has  been  described  earlier  in  manv 
reports,  the  generation  of  SAP  usinq  SAPG  will  not  be 
described  further.  The  reader  intendinq  to  use  SAPG  is 
advised  to  refer  the  oriqinal  reports  (RAM73 , FRE72 , 

RIN76). 

A detailed  flowchart  of  the  syntax  analysis  phase  is 
shown  in  Figure  5.2.  Some  of  the  sections  in  this 
flowchart  are  described  in  the  followinq. 

r 
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Figure  5.2.  Syntax  Analysis  in  MODEL  Processor.  p 


- 126  - 


The  purpose  of  the  Lexical  Analyzer  is  to  scan  for 
syntactic  units  or  "tokens",  using  such  delimiters  as 
blanks  and  certain  punctuation  marks,  and  return  them  to 
the  callinq  oroqram  for  syntactic  checkinq.  The  input 
statements  are  obtained  from  the  terminal  or  from  the 
MODEL  statement  database  (if  REFER  statements  were  used). 
The  lexical  analyzer  is  also  described  in  the  earlier 
reports  (RAM73,  FRE72) . However,  one  additional  feature 
has  been  added  to  the  current  system.  This  feature  is  the 
lookahead  feature,  which  allows  a supporting  routine  to 
look  ahead  (to  a certain  extent)  of  the  current  token. 
This  look-ahead  facility  orovides  back-trackinq  in  SAPG, 
which  is  otherwise  not  possible.  To  illustrate  this 
point,  consider  the  following  three  production  rules: 

<X> : s ■ <Y>  | <Z> 

<Y> : : * AH... 

<Z > : : * A C ... 

Because  there  is  no  back-tracking  facility  in  SAPG,  the 
strings  in  <Z>  cannot  be  recoqnized  using  the  above 
productions  (see  also  the  references  RAM73,  PRE72  and 
RIN76) . However,  using  the  look-ahead  facility,  it  is 


possible  to  check  the  next  token  (while  recognizing 
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production  <Y>),  and  if  it  not  3,  then  the  production  <Y> 
is  disabled.  Therefore,  the  strings  in  <Z>  can  be 
tecoqnized.  The  function  LEXRUFN (N)  is  used  to  look-ahead 
to  the  N th  token.  To  illustrate  the  use  of  the  routine 
LEXBUFN , let  us  rewrite  the  above  three  oroductions  as 
fol lows : 

<X> : : = <Y>  | <Z> 

<Y> : : = <CHEKY>  A 3 ... 

<7, >::=  A C ... 

<CHEKY> : :=  /CHEKY/ 


The  tecoqnizet  routine  CHEKY  used  in  the  last 

production  rule  can  be  written  as  follows: 

CHEKY:  PROC  RETURNS ( B IT ( 1 ))  ; 

DCL  LEX  ENTRY; 

DCL  LEXBUFN  ENTRY (BIN  FIXED)  RETURNS (CHAR ( 3 1 ) VAR ) ; 

DCL  LEXBUFF  CHAR (31)  VAR  EXT; 

CALL  LEX; 

IF  LEXBUFF  = 'A'  THEN 

IF  LEXBUFN ( 1 ) = *R ' THEN  RETURN (' 1 '3 ) ; 

ELSE  RETURN ( '0  ’B) ; 

ELSE  RETURN ( '0  *R)  ; 

END  CHEKY; 


4.1  Er rot  Stackinn  Routines. 


One  disadvantaqe  of  SAPG  is  its  inability  to  produce 
qood  error  diaqnostics  automatically  when  an  expected 
token  is  missino  in  the  inout.  User  has  to  orovide  the 


... 
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capability  bv  embedding  the  error  stackinq  routines  in  the 
EBNF . In  qeneral,  an  error  routine  must  be  specified 
before  every  syntactic  unit.  The  function  of  the  error 
stackinq  routine  is  to  stack  an  error  code  on  the  top  of 
an  error  stack,  and  if  the  expected  token  (syntactic  unit) 
was  recognized  successfully,  then  the  SAP  simply  pops  the 
error  code  on  the  too  of  the  error  stack  and  scaninninq  is 
continued.  If  the  expected  token  was  not  found  in  the 
input,  an  error  messaqe  associated  with  the  error  code  is 
repotted  to  the  user;  and  the  statement  till  the  error  is 
discarded. 

5.4  Recognizer  Routines. 

These  routines  are  supplementary  to  SAP  and  oetforms 
syntactic  recognition  of  some  soecial  tokens  like: 

(1) .  data  names  (routine  RECNAME) , 

(2) .  Integer  and  floating  point  data  constants 

(routines  RECINT  and  RECFLOT) , 

(3) .  Charater  and  Bit  strings  (routine  LEXCSTR) , 

and  (4).  special  operators  (routines  RECOR,  RECEXt , 
etc. ) . 


These  recognizer  routines  check  for  a generic  class  of 
syntactic  units,  and  in  qeneral,  it  is  very  difficult  to 
represent  the  function  of  a lecognizet  routine  as  a 
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production  in  EBNF.  This  is  because,  the  tokens  returned 
by  the  lexical  analyzer  ate  not  single  characteLS,  but  a 
strinq  of  ch.aiacters  sepaiated  by  some  delimiter.  In  some 
cases,  it  is  impossible  to  represent  a syntactic  unit  by  a 
production  rule.  For  example,  if  we  want  to  tecoonize  the 
operatoi  'll',  and  if  we  wiite  the  followinq  deduction 
rule: 

<RECEXP>::=  || 

then,  SAPG  will  be  totally  confused  because,  it  treats  • | • 
as  a meta-symbol  used  to  separate  alternative  syntactic 
units.  Therefore,  such  a syntactic  unit  must  be 
recoqnized  usinq  a recoonizer  routine.  For  the  above 
example,  the  otoduction  rule  will  be: 

<RECEXP> : : = /RECEXP/ 

and  the  routine  UECEXP  will  look  as  follows: 

RRCEXP:  PROC  RETURN*?  (BIT  ( 1 ) ) ; 

DCL  ( LEX , LEXENA0)  ENTRY; 

DCL  LEXBUFF  CHAR (31)  VAfr  EXT; 

CALL  LEX; 

IF  LEXBUFF  * 'll'  THEM  DO; 

CALL  LEXENA0 ; 

RETURN ( * 1 *P ) ; 

END; 

ELSE  RETURN ( *0  'B) ; 

END  RECEXP; 

5.5  £ncodin^  and  Storinq  Roy1.*1]®®* 


These  routines  essentially  oerform  the  followinq 
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functions : 

(1) .  Check  local  semantics  of  different  attributes 

associated  with  a particular  statement. 

(2) .  Encode  the  user  statements. 

and  (3).  Store  the  encoded  statements  in  the  associative 
memory. 

Local  semantic  check  is  performed  on  some  tokens,  like 
checkinq  the  ranqe  of  an  integer  constant,  validating  the 
dimension  specified  in  a field,  qroup  or  interim 
statement,  and  so  on. 

Encoding  routines  encode  some  of  the  attributes  of  a 
MODEL  statement  into  an  internal  representation.  The 
encoded  data  is  stored  in  the  data  area  of  the  storage 
entry  for  the  statement.  The  different  encodings  used  for 
each  statement  is  described  in  the  next  chapter. 

The  storing  routines  are  used  to  store  the  data  names 
used  in  a MODEL  statement  (and  the  encoded  data  associated 
with  it)  in  the  associative  memory.  All  these  routines 
use  the  STORE  routine  which  is  a part  of  the  storage  and 
retrieval  sub-system  described  in  the  next  chapter. 

5.6  HELP  - A Teaching  Aid  During  Syntax  Analysis. 

In  an  effort  to  make  the  MODEL  language  usable  by  a 
novice  user,  a teaching  aid,  HELP,  is  provided  during 


m 
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syntax  analysis.  This  aid  can  be  used  to  obtain  a biief 

i 

description  of  the  syntax  of  any  statement  in  MODEL 
lanquaqe,  or  to  know  the  significance  of  an  error  message 
reported  to  the  user.  This  teaching  aid  can  be  utilized 
by  includino  a HELP  statement  in  the  input  to  SAP.  As  any 
other  statement  in  MOOEL,  HELP  statement  also  has  a proper 
syntax.  However,  HELP  statement  is  never  stoted  in  the 

associative  memory,  and  any  information  requested  by  HELP  ‘ 

statement  is  immediately  displayed  to  the  user.  The 

formal  syntax  of  HELP  statement  can  be  described  simply  by  ‘ 

the  following  production  rules: 

<help-statement>: : = H ELD  ( <ar qunent > 1 
<ar gument > : : = MODEL 
| SYNTAX 

I <statement-type> 

I <ettot-code> 

If  the  argument  is  MODEL,  then  a short  list  of  all 
allowed  statements  in  MODEL  is  displayed.  r J 

If  the  arqument  is  SYNTAX,  then  syntactical  conventions 
used  in  the  descriDtion  of  syntax  of  MODEL  statements  ate 
displayed. 

The  syntactic  unit  <statement-tyoe>  in  the  above 
production  specifies  type  of  statement,  the  syntactical 

I 

description  of  which  is  desired.  Currently, 

<statement-tyoe>  can  be  anv  one  of  the  keywords  described 

, . 


I 


in  the  following  table 

<statement-type> 

MODULE 

SOURCEF 

TARGETF 

REFER 

MEDIA 

FILE 

RECORD 

GROUP 

FIELD 

INTERIM 

SUBSCRIPT 

ASSERTIONS 
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describes  the  syntax  of 

Module  statement 
Source  File  statement 
Target  File  statement 
Refer  statement 
Media  statement 
File  statement 
Record  statement 
Groutp  statement  ( 

Field  statement 
Interim  statement 
Subscript  statement 
Assertions. 


The  syntactic  unit  <error-code>  in  the  production  rule 
of  <argument>  specifies  the  error  code,  the  description  of 
which  is  to  be  disolayed.  In  this  case,  a short 
description  of  the  error  code  is  displayed  on  the 
terminal . 

If  no  argument  is  specified  in  a HELP  statement,  then  a 
short  descriotion  of  the  error  encountered  most  recently 
is  displayed  on  the  terminal.  If  no  error  was 
encountered,  then  the  argument  is  assumed  to  be  MODEL. 

The  use  of  HELP  is  demonstrated  in  the  example  shown  in 


Fiqure  5.3.  The  listinq  for  every  use  of  HELP  command  is 
given  in  Appendix  B. 


FOLLOWING  TYPES  Of  STATEMENTS  ARE  ACCEPTER  BY  MOREL 

(1) .  MODULE  STATEMENT  (MODULE). 

(2) .  SOURCE  FILE  STATEMENT  (SOURCEF). 

<J).  TAR6ET  FILE  STATEMENT  (TAR6ETF). 
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5.7  Refen  ing  MODEL  Statements  i_n  the  Library. 

Once  a module  is  stored  in  the  library  (MODEL  statement 
database),  any  section  of  that  module  can  be  referenced  in 
subsequent  MODEL  specifications.  Such  a reference  is  done 
by  usino  the  REFER  statement,  the  syntax  of  which  is  given 
by: 

<refei -statement >:: * REFER  <module-name>  ( . <section-name>] 

where  <module-name>  is  the  name  of  the  module  and 
<section-name>  is  the  name  of  the  section  (which  may  be 
qualified)  in  the  module  which  is  to  be  referenced.  The 
organization  of  a module  into  sections  is  described  in 
chapter  7.  If  the  section  name  is  not  specified,  then  the 
whole  module  (or  member  in  the  library)  is  included  in  the 
current  specification.  Just  as  in  the  case  of  HELP 
statement,  REFER  statements  are  never  stored  in  the 
associative  memory,  but  as  soon  as  a refer  statement  is 
successfully  scanned,  a switch  is  set  in  the  lexical 
analyzer  to  read  the  input  from  the  library,  rather  than 
from  the  terminal.  When  the  reading  from  the  library  is 
complete,  the  switch  is  reset  so  as  to  obtain  inout  from 
the  terminal. 
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5.8  Complete  E3NF  (with  SRC)  for  MODEL. 

I 

In  the  followinq,  the  complete  ERNF  for  the  MODEL 
system  is  qiven.  The  routines  enclosed  in  slashes  (/)  are 
the  subroutines  that  are  called  by  SAP  while  tecoqnizinq 
the  input  statement  cor  responding  to  the  production.  A 
short  description  of  each  such  routine  is  qiven  in 
Appendix  A. 
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EBNF  FOR  MODEL 

| 

<MODEL_SPECIFICATION>: :*  [ <MODEL_BODY_STMT>/$CLRERR/] * 

/LEXFAIL/  <MODEL_SPECIFICATION> 

<MODEL_BODY_STMT> : :*  /WUREC/  /SINITNM/ 

<END_STMT>  [;] 

| <HELP>  /SVCHELP/  [<NAME>/SVSHELP/] /SVXHELP/ ( ; ] 

| <REFER_STMT>  [;]  /LIBREAD/ 

I <RSEC>  /SVSNMl/  (< NAME >/SVSNM 2/1 /5VSNM3/ 

| <MOD_  REC>  /WSCOL1/:  [ <VARl  ABLF,>/S  INCtON/1 /SSTMOD/ f ; 1 
| <S_NAME>  [<F_NAME> ) /WSCOL1/  : <V_LIST>  /SSTSRC/  [;1 
| <DDL  OR_ASSER_STMT>  ( ; ) 

<END_STMT> : : * END/ENDINPT/ 

<HELP> : : * HELP  | ? 

<REFER_STMT>: :»  <RREFER>  (<RSEC>1 
( : 1 <VAR1ABLE>  /SVKEF/ 

(,<VARIA3LE>  /SVREF/1*  > 

<RREFER> : : * REF  | REFER  | REFERENCE 
<RSEC> : : * SEC  I SECTION  | SECTIONS 

! ; 

<VARIABLE>! s-  /WVARNl/  <NAME>  /SVAV1/  [.  /WVARN2/  <NAME>  1 

/SVAV2/  ]* 

<MOD_REC> : : * MODULE  | MOD 

<S_NAME> : : * SOURCE  | TARGET  I SOU  | TAR 

<F_NAMS> s s ■ F | FIL  I FILE  | FILES 

<V  LIST> : : * <VARIABLE>  /SINC#ON/  [ , <VARIABLE>  /SINC#ON/]* 


I 
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<DDL  OR  ASSER  STMT>: :=  <VARI ASLE>  /SVANAM/ 
<DDL_OR_ASSER_BODY> 

<DDL_OR_AS3ER  30DY>::=  /WDDASS/  : <ASSSRTION_BEG>  /SVASST/ 
I [ , <VARIA3LE>  /SINC#QN/] * f<IS>]  <DATA_ OR  ASSER> 
<ASSERTION_8EG>: :=  <REC3TV>  f <RECVAR>  ] /WSCOL1/: 
<AE_LIST> 

I <RECIT>  /WSCOL2,/  : <SAS_LIST> 

I <ASSEPTION> 

<RECSTV> : : * INITIAL  I INIT  I INI  | TEST  | TES  | 

SOURCE  I SOU  I TARGET  | TAR  | 

FUNCTION  | FUN  | SOL  I SOLUTION 
<RECVAR> : : = V | VAR  | VARS  | VARIABLE  | VA°AIBALES 
<RECIT> : : = INITIAL  I INIT  I INI  | TEST  | TES 
<AE_LIST>::=  <AR  ITS  EX  PRESS  ION  > /SVAAE5/  [, 

<ARITH_EX PPF.SSION>/S VAAE6/]  * /SVAAE7/ 

<SAS_LIST>: :=  <SIMPLE  AS5ERTION>  /SVAAE5/  [, 

<SI'1PLE  _ ASSERTION >/SVA  A E6/J  * /SVAAE7/ 

< IS> s : = IS  | ARE  | = 

< ASSERT  ION  > : s =/WASS  1/  /SVAAS1/  <IF_OR_FOR  CLAtJSE> 

/SVAAS 2/  <ASSERTION>  /SVAAS3/ 

[ELSE  <ASSERTION>  /SVAAS4/) 

I <S I NPLE_ ASSERTION > /SVAAS6/ 

<IF  ORFOR  CLAUSE>: :*  <IF_OR_FOR>  <FQR  ROOL> 

[ <THEN  OR  LET> ] 
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< I F_OR_FOR > : : * IF  I FOR 

<FOR_BOOL): :*  /SVAFORl/  <ARITH_EXPRESSION>  /SVAFOR2/ 

<THBN_OR_LET>: :=  THEN  I LET 

t 

<RRELREC> : : = » I > | < I I >■  I <- 
<RECOR> : :=  /RECOR/ 

<SIMPLE_ASSERTION):  :=«  /SVASAE1/  <ARITH_EXPRESSION> 

/SVASAE2/  /WEQUAL/  - <APITH  EXPRESSION)  /SVASAE 3/ 

< AR I TH_ EXPRESSION) : : = /SVAAEl/  [<PLUS_MINUS>  /SVAAE2/] 

<OR_EX PRESS ION)  /SVAAE3/ 

<OR_EXPRESSION>: :*  /SVAOR1/  <ANO_EX PRESS  ION)  /SVATS2/ 

[<RECOR)/SVAOR3/  <OR_EXPRESSION>  /SVATS4/) 

<AND_EXPRESSION)J ;«  /SVANDl/  <RELATIONAL>  /S VATS 2/ 

[ & /SVAND3/  <ANDJ3XPRESSION>  /SVATS4/1 
<RELATIONAL>: :«  /SVAREL1/  <CATENATION>  /SVATS2/ 

[ <RRELREC>  /SVAREL3/  <RELATIONAL)  /SVATS4/ 

[ <RECTO>  <ARITH  EXPRESSION)  /SVAREL5/  ] f 

{ BY  <ARITH_EXPRESSION)  /SVAREL5/  ).  1 
CATENATION): :«  /SVACAT1/  <TE»NS)  /SVATS2/ 

KRECCAT)  /SVACAT3/  <CATFNATION)  /S VATS4/1 
<RECCAT> : : * || 

< RECTO) : : • TO 

<TERNS> s : ■ /SVATS1/  <TERM>  /SVATS2/  [ <PLUS_MINUS> 

/3 VATS 3/  <TERMS>  /S VATS 4/1 
<TERM> : : ■ /SVAT1/  <FACTOR>  /SVATS2/  [<MOPR> 


140 


/SVAT3/  <TERM>  /SVATS4/I 
<FACTOR> : : = /SVAF1/  <PRIMARY>  (<RFCEXP> 

/SVAF 3/  <FACTOR>  /SVATS4/1 

<PRIMARY> : : = /WPRIM/  ( <ARITH_EX PRESSION)  /WPARAN/  ) 
/SVAP2/ 

I <RECINT>  /SVAP3/ 

I * /SVAP31/ 

I < FLOAT  DATA)  /SVAP4/ 

I <STKING>  /SVAP5/  f <LEXf3ITR>/5VAP51/l 

I <VARIA8LE>  /SVAP6/  ( <INDICES>  ] /3VAP8/ 

<LEXQITR> : : * /LEXRITR/ 

<RECINT> : : * /RFC I NT/ 

<FLOAT_DATA> : : * /RECFLOT/ 

<STRING> : : * /LEXCSTR/ 

< INDICES) : : = (<ARITH  EXPRESS IOM>  /SVAP7/ 

[ ,<ARITH_EXPRESSION)/SVAP7/] * /WPAHAN/  ) 
<RECEXP>::=  ** 

<NOPR) : : * * | / 

<NAME> : : * /RECNAME/ 

<RECFINT> s : * /PECFINT/ 

<DATA_OR_ASSER>: :=  /WDDS/  <MSOIA  DESCRIPTION) 

I <MOO  STMT> 

I <SFILE_STMT> 


I <FILE  DESC  STMT) 
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| <REC_GRP_FLD_INT> 

l 

<MOD_STMT>  : : * <MOD__RFC>  /SSTMOD/ 

<SFILE_STMT>  j : *<S_NAME> ( <F_NAME> 1 /SSTSRC/ 

<MEDIA_DESCRIPTION>: :*  <MEOIA_REC>  /IMEDIA/[ 

<MEDIA_DESC_STMT>1  /STMEDIA/ 

<MEDIA_REC>: :■  MED  | MEDIA 

<MEDIA_DESC_STMT>: :«  ( <MEDIADE5C>  [ , <MEDI ADESC> ] */WPARAN/) 

<MEDIADESC> : : * /WMEDl/  <TAB_SPEC> 

| <SPAC£_SPEC> 

| <DSM_SPEC> 

| <KEY_W0RD1>  /WMED2/  [<IS>]  <SVOPERAMD> 

| [<KEY_rtORD2>  [ < IS  > ) 1 <SVOPERAND> 

<TAB__SPEC>  : : * <TAB_REC> [<IS>] /ISPTAB/  <TAB_PARS>/ISPTABE/ 

<TAB_REC>::«  TAP 

<TAB_PARS>: :■  /WTAP/  [(]  <RECINT>  [ [,1<RECINT> 

/ISPTABI/J*  [)] 

<SPACE  SPEOss-  <SPACE  PEC>  [<IS>1  <SPACE  PARS> 

<SPACE_RBC>: :■  SPACE  l 

<SPACE_PARS> : s ■ /WSPACE/  f(]  <RSUNIT>  /ISPUNIT/ 

f , f <RECINT>/ISPINIT/)  \,  f <RECINT>/ISPINC/] 

[,[<RECINT>  /ISPDIR/  ] 1 ] 1 
( [ , ] <RRLSE>/ISPRLSE/ J f)l 


<RSUNIT>  J s ■ TRACKS  | TRACK  | TRK  | CYL  | CYLS  | BLOCKS 
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<RRLSE> : : = R | REL  | RELEASE  | RLSF 
<OSN  SPF.C> : : = <DSN_RFC>  [<IS>]  <ns  NAME> 

<DSN  RFC> : : * DSN  | DS  | DSNAME  | DATASET  | FILF  NAME 
<DS_NAME> : : =/WDSNl/<NAME>/IMDSNF/ [ . /WDSN 2/<N AME>/IMDSNN/ ] * 

( (<MEMDER>/WPARAN/)  1 

<MEMSER> : : = /WMNAME/  <NAME>  /I MOMEM/  | [<PLUS_MINUS> 

/IMDMEM/1  <FECINT>  /IMDMEMl / < 

<PLUS  MINUS>::=  + | - 

<KEY  W0RD1 > : : = <RPAGE>  | <PLINE>  I <RNOTRKS>  | 

V ! 1 

<RSFILE>  I <ROENSTY>  | <RINTNAM>  | 

<REXTNAM>  I <RKEY> 

<RPAGE> : : = P | LINES  PER  PAGE  | PAGE  SIZE 
<RL1\'E>: : * LIMFSIZF  | WIDTH  I PACE_WIOTH  | LINE  WIDTH 
<RNOTPKS> : : = TRKS  | NO  TRKS  I NO  TRACKS  | TRACKS 
<RSFI  LE> : : = FILE#  | START  FILE 

f'*1 

<ROENSTY> : : = DENSITY  I DEN 

<RINTNAM>::=  IL  | INT  NAME  | I LAPEL 

<REXTNA.M> : : = EL  | VOL  | VOLUME  | VOLSER  | ELASEL  | 

EXT  NAME 
— 

; 

<RKEY> : S * KEY  | 

<KEY  WORD2>  s : * <RRLKNAM>  | <RRECNAM>  I <RORGN> 


I <RRECPM>  I <RUNIT>  | <RTLAHRL> 
I <RPARTY>  | <PCCOOE>  I <RDISP> 
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<RBLKNAM> : : = QS  | RLOCKSIZE  | BLKSIZE 

' I 

<RRECNAM>:  :■  RL  | LRECL  | RECSIZE  I PECORDSIZE 

<RORGN> : : = ORG  I ORGANIZATION 

<RUNIT> : : * UNIT  | DEVICE 

<RTLABEL> : : * LABEL  I TAPE_LEBEL  I TL 

<RPARTY> : : = PAR  | PARITY 

<RCCODE> : : * CHAR_CODE  | CC  | CODE 

<RDISP>::«  DISP  | DISPOSITION 

<SVOPERAND> : : * <IMORG>  | <I'JRECFM>  | <IUNIT>  | <IULABEL> 

I <IUPARTY>  | <IUCHARC>  I <IUDISP>  | <IMNOTR> 

| <IMPEN>  | <NAME>  | <INTEGER> 

<IMORG> s : * SAM  | ISAM  | IS  | SEO  ! SEQUENTIAL  | 

INDEXEDSEQUENTIAL  | DIRECT  | INDEXED  | 

REGIONAL  | REG  | DIR  I REG1  | REG2  I RFG3 
<IURECFM>:;»  FIXED  | VARIABLE  | VARSPANNED  | UNDEFINED 
I FIXEDBLOCKEO  | F | FB  I V | VS  | U | 

I VARIABLE_BLOCKEO 

< IUNIT> : : * TAPE  | 3400-4  | 3400-1  | 3330  | DISK  | 

3330-1  | 2314  | 2311  I 2305  I TERMINAL  I 
3YSOUT  | PRINT  | PUNCH  | CARD  | SYSPRINT 
SYSIN  | OLS  | EXPORT  I IMPORT  | EX  | IM 
<IULABEL>::«  SL  | IBM  STD  | ANSI  STD  | NL  | NONE  | 
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BLP  | BYPASS 
<IUPARTY> : : ■ ODD  | EVEN 
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<IUCHARC>::=  EBCDIC  | BCD  I ASCII 
<IUDISP>:  :*  OLD  | SNR  | NEW  I '100 
< IUNOTR> : : = 7 I 9 

<1MDEN>::=  200  | 556  | 800  I 1600 
<FILE_DESC_STMT>: :=<RFILE>  /IMSDIA/  [ (/TINPRNT/ 

[ ,<MEDIADESC>] * /W PA RAN/) j/STFILF/ 

<RFILE> : : = FILE  | REPORT  | INPUT  I OUTPUT  | DATASET 
<REC_GRP  FLD_  INT> : : = <FLD_NAMR>  /IFIELD/  [ <FLD_STYT> ) 

/STFLO/ 

<FLD_NAME> : : * FIELD  | FIE  | GROUP  | GRP  | GRO 

I INTERIM  I I NT  I RECORD  | REC  I SUB 
| SUBSCRIPT 

<FLD_STNT> : : * ( /TINPRNT / f f , ) <FIELD  SPFC>  1*  /WPARAN/  ) 
<FIELD  SPEC>: :=  ( <OCC_DF,SC>  /WPARAN/  ) 

) <F1ELD_TYPE>  f (/IFPRXL/<LENGTH1 >/IFFXDLN/ 

(,<PECINT>  /I FFXDPR/]  /WPARAN/  ) 

[VAR  /IFSVAR/  1 ] 

I <PIC>  [<PICrURE>l 
I <TESTPP>  <OCC  DESC> 

<TES TRP> : : * /TESTRP/ 

<OCC  0ESC>:s-  <0CC_LIST>  [ , <OCC_LIST> ] * 

<0CC  LIST>s:-<MIN  OCC>  { : <NAX  0CC>) 

<M IN_OCC> : : * /WSIZEL/  * /IFOCCS/ 

I <RECINT>  /IFOCCI/ 
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| /IFPRXE/  <VARIA8LE>  /IFOCCN/ 

<MAX_OCC>  j : * /WSIZEU/  * /IFOCCSS/ 

| <REC I NT>  /IFOCCI 1/ 

| /IFPRXE/  <VARI ABLE>  /IFOCCNN/ 

<FIELD_TYPE> : : * CHA  I CHAR  | CHARACTER 

| NUM  | NUMERIC  I BIT  | BINARY 
| BIN  | DECIMAL  | DEC  I FIXED 
<LENGTHl > : : ■ /WLENTH/  * /IFSFWM2/ 

| <RECFINT> 

| <VARIABLE>  /IFSFWMl/ 

<PIC> : s * PIC  I PICTURE 
<PICTURE>::»  * [ <STRING>  1 ' 


CHAPTER  6 

ASSOCIATIVE  MEMORY,  DICTIONARY  AND  PRECEDENCE  MATRIX 

In  this  chanter,  the  oroani zation  of  the  associative 
memory  in  MODEL  orocessor  is  described.  The  Associative 
Memory  (AM)  consists  of  Stoiaoe  and  retrieval  sub-svstem. 
Dictionary  and  the  Precedence  Matrix.  In  the  followinq, 
the  above  parts  of  the  VI  and  the  routines  used  to  create 
and  access  those  parts  are  ienctibed.  Also,  some  of  the 
relevant  data  structures  contained  in  the  AM  are 
described.  Fiqute  8.1  nives  an  overview  of  the  creation 
and  access  of  Associative  Me.motv,  Dictionary,  and  the 
Precedence  Matrix.  In  this  fiqute,  two-wav  arrows 
indicate  both  creation  and  access. 

6.1  The  Stor_ane  and  Retrieval  Subsystem. 

8.1.1  Introduction. 

Dutina  the  syntax  analysis,  each  user  statement  is 
stored  in  the  associative  memory  in  a coded  form.  This 
stored  information  can  be  later  retrieved  or  undated.  The 
subsystem  for  such  ooerations  consists  of  the  following 
routines : 
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(1)  STORE  - this  routine  stoi.es  the  source  language 
strinqs  obtained  from  the  user  statements  during  syntax 
analysis. 

(2)  RETRFVE  - this  routine  is  used  to  access  a qroup  of 
storaqe  entries  that  has  some  common  prooertv. 

In  addition  to  the  above  three  routines,  the  routines 
RETP#E,  RETRN.AM  and  RFTRPRX  have  been  implemented  to 
retrieve  some  additional  information  about  a particular 
key  in  the  memory. 

The  STORE  procedure  accents  strinqs  which  ate  formed  bv 
the  subroutines  called  during  syntax  analysis.  It  creates 
two  types  of  entries  in  the  memory: 

(1)  Storaqe  entries.  One  storage  entry  is  created 
for  each  MODEL  statement.  The  routine  STORE  is 
called  after  successful  scanning  of  each  statement. 

(2)  Directory  entries.  One  entrv  is  created  for 
each  key  name  in  the  string  heinq  stored. 

By  building  the  directory,  the  strings  are  stored 
"associatively"  in  the  sense  that  statements  can  later  be 
retrieved  based  on  their  content. 


<*.1.2  The  Directory  and  the  Storage  Structure. 

*■*..  director v consists  of  an  entry  for  each  key  name. 
••  - ‘.rectory  entrv  ooints  to  the  first  storage  entry 
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containing  that  name.  A "linked  list"  is  then  maintained 
from  the  first  storaqe  entry  with  that  key  name  to  other 
storaqe  entries  containinq  the  same  key  name.  A 

"branch  and  bound"  binary  tree  structure  is  chosen  for  the 
directory  to  make  the  modifications  and  seatchino  for  the 
key  names  efficient.  Thus,  the  first  key  name  entered  in 
the  directory  becomes  the  root  of  the  directory  tree;  the 
next  key  is  entered  "above"  or  "below"  it  in  the  tree  by 
lexicoqraphic  order;  etc.  1 

Each  directory  entry  has  the  following  format; 

I + + + 1 

I key-name  | uo-otr  I down-otr I f-ptr  I 

I + + + | 


where 

"key-name"  is  a string  of  (un  to)  .10  characters  (padded 
with  blanks)  ; 


"up-ptr"  and  "down-ptr"  are  oointers  to  other  directory 
entries,  whose  key  names  are  up  or  down,  respectively, 
in  the  lexicogr aohic  sense;  and 
"f-ptr"  is  a pointer  to  the  first  storaqe  entry  that 
contains  the  key  name. 

The  storage  entries  (the  strings  to  be  stored) 
consists  of  two  Darts: 


) 

\ 

( 

» 

1 


r 

■i 
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I 


include  the  names  the  user  orovided  in  the  MODEL 
statements  foL  naminq  data,  assertion,  etc.  These  are  the 
names  by  which  we  may  want  to  retrieve  information  later. 
(2)  Auxiliary  data  from  the  source  lanquaae  statements. 
This  data  is  in  an  encoded  form  and  is  not  used  as  basis 
for  retrievals. 

The  storaqe  entries  have  the  followinq  format: 

I— + ++ + ++ ++- + 1 


In  Idata-ptl Idptr (1) Inotr (1) | | . . . I I dotr (n) I notr (n) I 


I 


where 

Mn"  is  the  number  of  key  names  in  the  storaae  entry 
string ; 

"data-pt"  is  the  pointer  to  the  auxiliary  data  that 
contains  the  coded  r epr  esentation  of  the  source 
language  statement; 

"dptr(i)H  (i  = 1 to  n)  is  the  directory  entry  pointer  to 

the  ith  key  name;  and 

"nptr(i)M  (i  = 1 to  n)  is  the  Dointer  to  the  next  storage 


entry  which  contains  the  same  key  name 


151 


In  general,  the  name  represented  by  the  second  key  name 
in  the  storage  entry  (dptr(2))  is  the  data  name  (or  the 
assertion  name)  that  the  storage  entry  represents. 

6.1.3  Key  Names. 

The  names  supplied  by  the  user  for  naming  data, 
assertions,  etc.,  are  orefixed  by  a two  character  nrefix 
to  obtain  the  key  name  for  the  corresponding  name.  The 
prefix  identifies  the  type  of  name  that  is  being  stored. 


Foi  example. 

the 

prefix  is 

•$G'  for 

data  names 

that 

represent  groups. 

1 $A ' for 

assertion 

names,  'IE' 

for 

"exist"  names. 

and 

so  on.  The  following 

table  gives 

the 

allowed  prefixes 

and  the 

description 

of  what 

they 

represent. 

I 

I 

\ 

{ 

I 

l 


* 
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TABLE  6.1  , 


1 - 

PREFIX 

| NAME  TYPF 

$ A 

1 Assertion  name 

f A 

(Qualified  Ass.  name 

$B 

1 Subset iot  name 

$c 

1 Record  name 

$D 

|Media  name 

lExist  name 

$F 

|File  name 

SG 

IGtouo  name 

SI 

1 Interim  name 

# K 

1 Key  name 

$L 

iField  name 

$M 

1 Module  name 

$P 

1 Patent  name 

*0 

IQualified  name 

ss 

1 Source  vat.  name 

ST 

1 Tat  net  vat.  name 

1 

In  addition,  the  fitst  key  name  in  any  storage  entrv  is 
a soecial  two  character  nane  which  identifies  the  tyne  of 
statement  the  storage  entry  reoresents.  The  different 
statement  tynes  ate  given  in  the  following  table: 


f- 


i 

1 
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TABLE  6.2 


1 First  key 

1 name  in  the 
Istoraqe  entry 

1 Statement  tv^e 

1 

i $A 

1 Assertion 

1 $B 

ISubscriot  statement 

1 $c 

1 Record  statement 

1 $D 

iMedia  statement 

1 $F 

iFile  statement 

1 $G 

IGroup  statement 

1 $1 

1 Interim  statement 

1 $L 

iField  statement 

1 $M 

1 Module  statement 

1 $s 

iSource  file  statement 

1 $T 

iTarget  file  statement 

If  a data  name  supplied  by  the  usei  is  not  lonqet  than 
8 characters,  then  the  data  name  is  directly  stored  (after 
prefixing  with  an  appropriate  prefix)  in  the  directory. 
If  the  name  is  lonqet  than  8 charaters,  then  the  directory 
name  contains  a "coded  name"  which  is  4 characters  lonq. 
This  coded  name  alonq  with  the  two  character  Prefix  is 
stored  in  the  directory.  The  names  that  are  lonqet  than  8 
characters  are  saved  in  a tree  structure,  and  the  coded 
name  is  actually  the  index  of  that  name  in  the  tree 
structure.  The  maximum  allowed  lenqth  for  any  data  name 


is  31 
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6.1.4  The  Data  ^te a of  Storage  Entries;. 

Each  storage  entry  contains  a pointer  (data-pt)  to  the 
"data  area"  (auxiliary  data)  which  contains  a coded 

f 

information  of  the  MODEL  statement  that  the  storaqe  entry 
reoresents.  Most  of  the  information  that  is  contained  in 
the  data  area  is  filled  in  during  the  syntax  analysis. 

Some  of  the  entries  aie  filled  in  luring  the  creation  of 

' 

the  "dictionary"  which  is  described  in  the  later  sections. 

i 

Tne  data  area  consists  of  two  Darts:  (1) 

"common  data  area"  and  (2)  "extended  data  area"  in  the 
following  format: 


I +- — 

I common  data  area  I extended  data  area 
| + 


6. 1.4.1  The  Common  Data  Area. 


The  common  data  area 

is 

common 

to 

all 

types  of 

• 

statements.  This  contains 

5 

entries 

in 

the 

following 

format : 


r 

l 

[ 

. 


i 


comment-ptr 


user-line# 


otoc-line# 


data-list 


reserved 


where 

"comment-ptr"  is  pointer  to  a strinq  that  contains  the 
comment  associated  with  the  statement. 

"user-line#"  is  the  input  line  number  in  which  the 
statement  beqins. 

"proc-line#"  is  the  statement  number  supplied  by  the 
system. 

"data-list"  is  a pointer  to  a block  of  data  which  is 
filled  during  the  creation  of  dictionary. 

The  pointer,  data-list,  points  to  the  data  block  which 
has  the  following  formats 


I 

t 

( 

( 

t 

r 

r 

I 


whete 

"parent-ptr " is  the  storage  entry  oointer  of  the  parent 
data  name  for  this  data  name. 

"indexl"  is  the  dictionary  index  for  this  data  name. 

Mindex2"  is  a duolicate  dictionary  index  of  the  same  same 
data  name  (which  is  used  for  the  data  names  in  an 
uDdate  file) . 

"status-bits"  is  a set  of  bits  that  indicates  the  type  of 
data  name  (source  or  target).  There  are  seven  bits  in 
this  set.  The  significance  of  these  bits  is  given  by 
the  following  table: 


I 

i 

i 

j 
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indexl  I index2 


status-bits 


dtr -ptr 
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TABLE  6.3 


I Significance  I 


IDefined  as  descendant  of  a source 
I file 

IDefined  as  descendant  of  a target 
I file 

I Used  as  source  of  an  assertion 
I Used  as  target  of  an  assertion 
I Resolved  as  source  variable 
IResolved  as  target  varaible 
I Resolved  as  interim  variable 
IResolved  as  Undate  variable 


"dtr-ptr"  is  a pointer  to  a linked  list  that  contains  the 
storage  entry  pointers  of  all  the  immediate 
descendants  (daughters)  of  this  data  name.  This 
linked  list  contains  entries  in  the  following  format: 


I ste-pointer  I sis-otr 

I + 


where 

"ste-pointer"  is  a pointer  to  the  storage  entry  of  a 
daughter  name;  and 

"sis-ptr"  is  a pointer  to  a similar  structure  above,  which 
contains  the  storage  entry  pointer  of  the  next 
daughter  name. 


iStatus  bit 

I 

| SRC  BY  DOS 

I ~ ~ 

| TAR  By  DOS 

I 

|SRC_BY_ASS 
| TAR_BY  ASS 
|SRC_BY"RES 
I TAR  BY_RES 
IINT'  BY  RES 
|UPD'BY~RES 


6. 1.4. 2 The  Extended  Data  Area. 

The  Extended  data  aiea  contains  some  additional 
information  about  the  statement.  Theie  ate  thtee  tvoes  of 
extended  data  ateas.  These  thtee  tyoes  ate  used  in  the 
followinq  qtouos  of  statements: 

(1)  assettions; 

(2)  Media  and  File  statements;  and 

(3)  Record,  Gtouo,  Field,  Iotetim  and  Subscript 
statements. 

The  data  atea  of  the  Module  name  statements.  Source 
File  statements,  Tataet  File  statements  and  Refer 
statements  consists  of  only  the  common  data  atea  described 
above.  In  the  followinq,  the  thtee  tvoes  of  extended  data 
ateas  ate  desetibed  in  mote  detail. 

6. 1.4. 3 Extended  Data  Atea  for  Assettions. 

The  format  of  the  extended  data  atea  for  assertions  is 
qiven  in  the  followinq; 
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vl-ptr 

1 

i 

ass-ptr 

1 

nass-ptr 

1 

r 

sv-ptr 

i 

tv-ptr 

i 

fn-ptr 

init-ptr 

i 

test-ptr 

i 

sol-ptr 

i 

wheie 


ptr  " is  a 

pointer 

to  a list  of 

variables 

used  in 

the 

assertion. 

This 

oointer  is 

obtained 

dur inq 

the 

creation  of  the  dictionary. 

"ass-ptr"  is  a pointer  to  the  derivation  tree  of  the 
assertion.  This  derivation  tiee  is  qenerated  dutinq 
the  syntax  analysis  (see  also  the  descriDtion  of  DT  in 
section  6.4). 

"nass-ptr"  is  a pointer  to  the  data  area  or  storage  entry 
of  another  assertion  if  the  two  assertions  belona  to  a 
set  of  simultaneous  equations. 

"sv-ptr"  and  "tv-ptr"  are  pointers  to  the  source  and 
tarqet  variable  list,  resoectively , for  this 


. 
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I 

I 

( 


assertion.  This  list  is  specified  by  the  user.  i 

"fn-ptrM  is  a oointeL  to  a list  of  function  names  used  in 
the  assertion. 

"init-ptrH  is  a oointer  to  a list  of  initial  values  for 
the  variables  used  in  the  assertion. 

"test-ptr"  is  a oointer  to  a list  of  conditions  associated 
with  each  variable  for  termination  of  the  iteration  in 
the  solution  of  the  simultaneous  equations. 

"sol-ptr"  is  a pointer  to  a list  of  solution  methods  to  be 
used  (in  the  case  where  numerical  analysis  orocedures 
are  to  be  used,  such  as,  with  the  solution  of 
simultaneous  equations). 

All  the  above  oointers,  except,  ass-ptr  and  nass-ptr 
ooint  to  a dynamic  list  of  entries  havinn  the  followinq 
format: 

I +- +- +- I 

I n | ptr(l)  I ...  I otr(n)  I 

I +- + +- — I '! 

i 

where 

"n*  is  the  number  of  elements  in  the  list;  and 


ptr(i)"  (i  * 1 to  n)  is  a oointer  to  the  ith  element  in 
the  list.  This  oointer  ooints  to  the  derivation  tree 
of  an  "arithmatic  exDtession"  in  the  case  of  sv-ptr. 
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tv-ptrf  fn-ptr,  anti  sol-ptr.  It  points  to  the 
derivation  tree  of  a "simple  assertion"  in  the  case  of 
test-ptr  and  init-ptr.  It  points  to  a structure 
"var-list"  (described  later)  in  the  case  of  vl-ptr. 

The  oointer  , ass-ptr,  in  the  extended  data  area  above, 
points  to  the  derivation  tree  of  the  assertion  itself? 
whereas,  the  nointer , nass-ptr,  points  to  a structure 
which  has  the  followinq  format: 

| + + 1 

I code  | first-ptr  I next-otr  I 

I + + 1 

where 

"code"  is  'D',  'S ' or  blank  and  represents  the  tyDe  of 
nointer  specified  by  next-ptr. 

"first-ptr"  is  a storage  entry  pointer  of  the  first 
assertion  in  a set  of  simultaneous  eauations. 
"next-ptr"  is  a pointer  to  the  data  area  (if  code  = 'D') 
of  another  assertion  having  exactlv  the  same  name,  or 
it  points  to  the  storage  enttv  (if  the  code  * 'S')  of 
another  assertion  which  belong  to  a set  of 
simultaneous  equations.  If  the  code  is  blank,  then 
the  assertion  represented  by  the  current  data-area  is 
the  last  assertion  in  the  set. 


A0-A063  900  MOORE  SCHOOL  OF  ELECTRICAL  ENOINEERIN®  PHILADELPHIA  P— ETC  F/®  9/2 

VERIFICATION  ANO  CORRECTION  OF  NON-PROCEDURAL  SPECIFICATIONS  IN— ETC (U) 
SCP  7S  S K SHASTRT  N00014-76-C-0416 

UNCLASSIFIED  7S-01  NL 
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In  general,  one  storage  entry  exists  for  each  assertion  t 

name.  If  thete  exists  two  or  more  assertions  with  the 
same  assertion  name,  then  those  assertions  are  linked 

f 

together  (with  code  = 'D')  as  described  above.  Two  or 

mote  assertions  having  different  names  can  also  form  a 
simultaneous  set,  if  either,  a simple  name  in  one 
assertion  name  is  contained  in  another  assertion  name  (for 
example,  in  the  assertion  names  'A. 3'  and  'A.C',  the 
simple  name  'A'  is  contained  in  both),  or  those  assertions 
that  were  resolved  to  be  a simultaneous  set  during  the 
cycle  analysis.  In  such  cases,  the  data  areas  of  the 
assertions  that  form  the  simultaneous  set  are  linked 

* 

toqether  as  described  above,  with  code  = 'S*. 

6. 1.4. 4 Extended  Oata  Area  for  Mediai  and  Fije  Statements. 

The  format  of  the  extended  data  area  for  Media  and  File 
statements  is  given  below: 


I 

I 
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T ~§C<5CkSfz FPreCSrEIJ i z e T pa<3eS1ze|  linesTt^I  Spa£E  T 
] ill i J_2) I (2) I (2)  | (8) I 


TSTAET-TTLE ! ' TSfi — nWiJ3EtrrT5W!“TffEC PHTfftS G[  ASMS  rTVTTBlW  \ 

I _ (2)  I (20)  I (6)  I (1)  I (1)  I (1)1  (1)  I (1)1 


mssnr  nraftif  ? ■ FWJW^w"n«v=EW’=ROTBT“miirTT5  rs? — r 

I (1)  I (1)  I (1)  I (4)  | (1)  | (1) I 


I member  'nEmeT  data' set *nKm5~  “ " ~ \ 

I (8)  | (44) | 


Each  entry  in  the  above  list  represent  one  of  the 
parameter  associated  with  the  statement.  The  number  below 
each  entry  (enclosed  in  parentheses)  is  the  lenqth  (in 
bytes)  of  that  entry.  Each  parameter  of  a media  or  file 
statement  is  summai ized  in  table  4.1  of  Chapter  4. 


6. 1.4. 5 Extended  Data  Area  for  Other  Data  Statements . 

f 

The  format  of  the  extended  data  area  for  record,  field, 

; ■ 

group,  interim  and  subscript  statements  is  given  below: 


TPfeL6-Lfef5Sf?nTfCTUT^^WRTOecisioirrpmc>-tYf'EiVARvrNfl — r 

I iSl | (4)  | <2)_ I (1)  I (1/8) [ 


TPiMgreTsa  tbits  r wry  r wn  T'cgrrrr  wm  i . — r 

I (2)  I (2)  I (4) I (4)  | (4)  | (4)  | I 


In  the  above  list,  FIELD-LENGTH  specifies  the  lenqth  of 
field,  PICTURE-ATTR  represents  a pointer  to  the  Picture 
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attribute  associated  with  the  field,  PRECISION  specifies 
the  precision  associated  with  the  decimal  data,  FIELD-TYPE 
specifies  the  type  of  field  ('n'  for  binary,  *C ' for 
character,  'N*  for  numeric,  'D'  fot  decimal,  and  *T'  for 
bit),  DIMENSION  specifies  the  dimension  of  the  data  name, 
LB(i)  and  UB(i)  specify  lower  bound  and  upper  bound 
associated  with  the  size  of  the  i th  dimension.  If  a 
bound  of  a dimension  is  a data  name,  or  then  the 

cotrespondinq  bit  in  BITS  is  set.  Also,  if  the  field 
lenqth  is  varyinq,  then  the  bit  VA°YINC,  is  set.  Note  that 
the  first  four  entries  ate  allowed  for  only  the  field  and 
interim  data  names.  Also,  in  the  case  of  subscript 
statement,  UB(1),  U3(2)  and  U3(3)  snecifv  the  lower-bound, 
uorer-bound  and  increment,  respectively,  associated  with 
the  subscript  name;  and  U3(4)  specifies  the  dimension  of 
the  parent  data  name  of  the  subscript  with  which  it  is 
associated . 

6.1.5  Creatinq  and  Accessing  the  Associative  Memory. 

This  section  describes  the  different  routines  used  to 
create  and  access  the  directory  and  storaqe  entries  of  the 
associative  entries.  The  followinq  routines  are  used; 

(1) .  STORE 

(2) .  RETRFVE 

(3) .  PETF  < E 

(4) .  RETPNAM 

(5) .  RETPPPX 


6. 1.5.1  The  STORE  Procedure  (Store  Statements) . 

The  STORE (S,D)  procedure  has  two  parameters,  S and  D. 
S is  the  string  containing  the  key  names  which  are  to  be 
stored  and  to  be  entered  in  the  directory.  D is  the 
pointer  to  the  previously  built  auxiliary  data  from  the 
source  string.  The  latter  is  an  encoded  form  of  non-key 
source  lanauaqe  information. 

The  detailed  algorithm  for  the  STORE  procedure  is  qiven 
in  RIN76. 


6. 1.5. 2 The  RETPEVE  Procedure  (Petreve  Storage  Entries) . 

The  RETREVE (E ,D,S ,N, P)  is  the  procedure  for  retrieving 
desired  storage  entries,  by  searching  through  the 
associative  memory  entries.  It  has  5 parameters.  RETREVE 
finds  all  the  storage  entries  in  which  the  given  key  name 
or  expression  of  key  names,  E,  appears  and  furthermore 
checks  whether  the  first  characters  of  data  associated 
with  the  storage  entries  match  the  string  D.  That  is, 
RETREVE  finds  all  the  storage  entries  with  key  satisfying 
the  logical  expression  E and  the  other  data  0.  RETREVE 
starts  its  search  at  directory  entry  S,  normally  the  root 
of  the  directory,  and  it  returns  a list  of  pointers  p,  to 
those  entries  which  satisfy  the  request  by  the  calling 
program.  The  number  of  storage  entries  satisfying  the 
request  is  returned  in  N. 
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The  logical  expression  P user*  to  retrieve  strinqs  can 
be  any  boolean  expression  involvinq  the  keynames  in 
disjunctive  normal  form,  where  the  first  key  in  each  term 
is  non-neqated.  For  example,  the  following  call: 


CALL  RETREVE ( ’$G 


& $G  ITEMS  START,  N,P) 


will  return  in  N , all  the  storaoe  entry  pointers  which 
contain  both  the  key  names  *$G'  and  ' SGITE  MS  ' . Number  of 
such  storage  entries  is  returned  in  P. 

The  detailed  algorithm  for  the  RETRFVF  procedure  can  be 
found  in  RIM76. 

6. 1.5. 3 The  Function  RFTR* F (?.etr^Ye  I of  Entries). 

The  function  PETRIE (S)  returns  the  number  of  storaoe 

entries  in  which  the  key  name  S occurs.  Usually,  this 

routine  is  used  in  conjunction  with  the  RETRFVF  procedure 

to  obtain  the  size  of  the  array  of  oointers  (fourth 

argument)  in  which  the  reouired  storaoe  entrv  pointers  are 

to  be  returned.  The  followinq  sequence  of  ^L/I  statements 

illustrates  the  use  of  the  function  RFTP*F: 

DCL  N SIN  FIXEPj 
DCL  T (N)  PTR  CTL ; 

N « PETRIE ( 'SO ITEMS •)  ; 

ALLOCATE  T; 

CALL  RETREVE ('Sg  s,  SOITFMS  * , " , START , T , I ) ; 


FREE  T; 


Note  that  the  aroument  S of  RETRfE  can  only  be  a single 


key  name 


6. 1.5. 4 The  RETRN^M  Procedure  (Retreve  by  Name) . 

The  RETFNAM (C ,D,N)  returns  in  D,  all  the  key  names  in 
the  directory  which  match  with  the  strinq  S,  except  for 
the  first  two  character  prefix.  The  number  of  such  names 
is  returned  in  N.  For  example,  if  the  directory  contains 
the  key  names,  'SGX',  '$GY ' and  '$PX',  then  the  following 
call : 

CALL  RETRNAM ( 'SGX  ' ,D,N) 

will  return  in  D,  the  two  names,  '$GX'  and  '$PX';  and  "2" 
is  returned  in  N. 

6. 1.5. 5 The  RETRPRX  Procedure  (Retreve  by  Prefix) . 

The  RETRPRX (S,D,N)  returns  in  D,  all  the  key  names  in 
the  directory,  the  prefix  of  which  is  contained  in  the 
strinq  S.  The  number  of  names  found  is  returned  in  N.  In 
the  example  of  section  6. 1.5. 4,  the  followinq  calls 
CALL  RETRPRX(  'SG',D,N) 

will  return  in  D,  the  two  names,  '$GX'  and  ' $GY 1 ; and  "2" 


is  returned  in  N 
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6.2  The  Dictionary. 

The  dictionaiv  contains  entries  for  the  data  names  and 
assertion  names  supplied  by  the  user.  It  is  created 
during  the  semantic  analysis  of  statements  stoied  in  the 
AM  which  is  described  in  chapter  8.  In  general,  an 
assertion  name  can  have  only  one  dictionary  entry, 
whereas,  a data  name  can  have  more  than  one  entry  if  that 
data  name  is  used  with  different  subscripts.  A dictionary 
entry  has  the  following  format: 

I + + + + + 1 

Isteptr  Isub-ptr  |sub#  Inindex  ladimott  Iwotr  I 

I -+ + +- + + I 


where 

"steptr"  is  a storaqe  entry  pointer  of  the  data  name  that 
this  dictionary  entry  represents; 

"sub-ptr"  is  a pointer  to  the  subscript  list,  if  any, 
asoociated  with  that  data  name; 

"sub#"  is  the  dimensionality  of  the  data  name; 

"nindex"  is  the  next  dictionary  index  of  the  same  data 
name,  but  with  different  subscript; 

"adjmptr"  is  a pointer  to  a list  that  contains  the 
"precedence  matrix"  entries  for  this  dictionary  entry; 
and 


, '■JIutT  ftr-Tr  ‘ 


- 
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"wptr"  is  a nointet  to  the  list  of  structures  that  define 

( 

the  map  vector  associated  with  each  type-3  edae  that 
emanates  ftorr  this  node. 

Note  that  a data  name  can  have  mote  than  one  dictionary 
entry  in  the  following  situations: 

(])  The  data  name  is  a descendant  of  an  undate  file;  or 
(2)  The  lata  name  is  an  array. 

In  the  latter  case,  each  use  of  the  data  name  with 

I 

different  subscript  has  an  entry  in  the  dictionary. 

For  example,  if  MATRIX  is  an  array  of  sire  10x10,  and 
it  is  used  in  three  forms:  matr IX  f 1 , R)  , ’’ATRIX,  and 
MATRIV(5,*),  then  ? dictionary  entries  will  he  cieated  foi 
the  three  different  representations. 

6.2.1  Description  of  Subset  jots . 

A subscript  associated  with  a data  name  (pointed  by  t 

sub-otr  in  the  dictionary)  is  described  bv  the  following 
PL/I  structure: 

I 

I 


I 


I 


I 

) 

I 

I 

t 
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u«_u  1 VAR  LIST  BASFD ( VARLISTP)  , 

2 SRC  ?5R_TAR  CHAR(l), 

/*'§•  - IF  THF.  VAR  IS  ON  TUG  PH3  OF  ASSERTION 
OH  IN  THE  < I F C LAU5F > OF  THE  ASSERTION, 

•r*  - IF  ON  THE  LH3  OF  ASSERTION , */ 

2 ELSE_3IT  SIT ( 1 ) , /*THIS  3IT  IS  SET  IF  THE  VARIABLE 
IS  IN  THE  ELSE  CALUSE  OF  THE  ASSERTION*/ 

2 RED  FN  BIT(l),  /‘SET  IF  THIS  NAME  IS  ARGUMENT  OF 

A REDUCTION  FUNCTION*/ 

2 #OF I BIN  FIXED,  /‘NUMBER  OF  SUBSCRIPTS*/ 

2 LINDEXO  BIN  FIXED,  /‘DICTIONARY  INDEX*/ 

2 VAR  NAM  PTR  PTR , /‘POINTER  TO  VARIABLE  NAME*/ 

2 SUBSCRIPTS (N  REFER (VAR  LIST. *OFI ) ) , 

3 NAM  OP  INT  B IT ( 4 ) , /‘‘O'?  IF  INTEGER, 

•l'B  OTHERWISE.*/ 

3 FCN_BIT  R IT ( 1 ) , /*SET  IF  FUNCTION*/ 

3 CONST  BIT  B IT ( 1 ) , /‘SET  IF  THF  SUBSCRIPT 
IS  A CONSTANT*/ 

3 VAPSUB  PTR  PTR,  /‘POINTER  TO  SUBSCRIPT  NAME*/ 

3 LB  BIN  FIXED (31,0) , /* LOWER  BOUND*/ 

3 UB  BIN  FIXED  (31,0)  , /‘UPPER  BOUND*/ 

3 INC  BIN  FIXED (31,0) , /‘INCREMENT*/ 

3 OFF  BIN  FIXED (31,0) ; /*  OFFSET*/ 

/*I  = L1*  TO  UB  BY  INC  A(I+OFF)*/ 


6.2.2  Descriotion  of  Mao  Vector. 


The  oointei , WPTR,  in  each  dictionary  entry  ooints  to  \J 

the  following  structure,  which  desciihes  the  .uao  vectors 
associated  with  all  the  tvoe-8  successors  of  the  node. 


I 
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DCL  1 WEIGHTV_LIST  BASFD ( WVPTR ) , 

2 WDIM  BIN  FIXED,  /‘DIMENSION*/ 

2 #TYPE8  BIN  FIXED,  /*#  OF  TYPE  8 EDGES*/ 

2 W V (N  REFER ( ITYPE8) ) , /‘WEIGHT  VECTORS*/ 
3~X0  BIN  FIXED,  /*LOWER  BOUND*/ 

3 XM  BIN  FIXED,  /‘UPPER  BOUND*/ 

3 CX  BIN  FIXED,  /‘INCREMENT*/ 

3 YO  BIN  FIXED,  /‘LOWER  BOUND*/ 

3 YM  BIN  FIXED,  /‘UPPER  30UND*/ 

3 CY  BIN  FIXED;  /‘INCREMENT*/ 


The  siqnificance  of  the  different  entries  in  each  mao 
vector  is  described  in  chapter  8, 


6.2.3  Creating  and  Accessing  the  Dictionary.  , 

In  this  section,  the  different  routines  that  are  used 
to  create  and  access  the  dictionary  are  described  briefly. 

The  following  routines  are  used; 

(1) .  ACRDO 

(2) .  ACRD1 

(3) .  ACRDR 

(4) .  ACRDC 

(5) .  ACRDRA 

(6) .  ACRDRS  i*1 

(7) .  ACRDFP 

Even  though,  the  dictionary  was  described  in  the 
earlier  section  as  an  entry  for  each  data  name,  it  is 
actually  organized  in  a grouo,  each  group  containing  50 
entries  as  follows: 
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FIRST  PTR 


I 1 I next-ptrl  dictionary  entries  1 to  50  I 


I 2 | next-ptrl  dictionary  entries  51  to  1001 

| + + | 

• • • and  so  on* 


In  order  to  access  i th  dictionary  entry,  it  is 
necessary  to  search  thru  the  list  of  stacks  for  the  j th 
stack  and  then  access  the  k th  entry  in  that  stack;  where, 
j - 1 + i /50 

and  k « 1 + MOD(i-l,50) 

6. 2. 3.1  The  ACROQ  Procedure. 

The  ACRDO(S)  orocedute  creates  a dictionary  entry  for 
the  data  name  with  the  storage  entry  oointer  S.  It  also 
creates  a data  area  which  is  linked  to  data-list  of  the 
common  data  area  of  the  storage  entry  (see  section 
6. 1.4.1).  This  list  is  used  to  contain  Dointers  that  link 
the  storage  entry  with  its  parent  and  daughter  names. 
This  routine  is  usually  called  to  add  an  assertion  name 
(or  a data  name  in  a data  descriDtion  statement)  to  the 
dictionary. 


i 

t 

I ; 

i 
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6.2. 1.2  The  ACPD1  Procedure. 

The  ACt’Dl  ( S , 1 , p I ) procedure  aeates  a dictionary  entLy 
for  the  data  name  with  the  storage  entry  nointei  S.  Also, 

I soecifies  the  dimensionality  of  the  data  name,  and  if  it 
is  zeto,  then  the  use  of  the  data  name  did  not  have  any 
subsctiDt.  The  subsctiot  description  of  the  data  name  is 
qiven  by  the  pointer  to  a structure  VAR  LIST  (described 
later).  This  pointei  is  aiven  bv  the  exteinal  pointei 
VARLISTP.  The  index  of  the  dictionary  entrv  created  is 
returned  in  PI. 

The  function  of  this  routine  is  to  search  the 
dictionary  for  the  same  data  name  and  the  subscript.  If 
an  entrv  is  found,  then  no  entrv  is  created.  Otherwise, 
an  additional  entry  is  created  bv  oronerlv  1 inkino  this 
entry  to  the  Previous  entry  of  the  same  name. 

6. 2. 3. 3 The  ACRDR  Procedure  (Peturn  Storage  Potty  Ptr). 

The  ACROR(I,PTR)  procedure  returns  the  storaoe  entry 

pointer  of  the  Ith  dictionary  entry  in  the  second 
argument,  PTR.  The  pointer  to  subscript  information 
associated  with  that  dictionary  entry  is  returned  in  the 
pointer  VARLI3TP  (which  is  an  external  pointer). 

6.2. 3.4  Vhe  ACPDC  Procedure  (Duplicate  Di^ct.  Entry). 

The  ACRDC(PTR)  Procedure  creates  a duplicate  dictionary 
entry  for  the  storage  entry  oiven  by  ptr.  This  routine  is 
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used  to  create  duolicate  entries  for  the  descendants  of  an 

\ 

update  file. 

6. 2. 3. 5 The  ACPDRA  Procedure  (Return  PM  Entry  List  Ptr . ) . 

r 

The  ACPDRA ( I , PTR)  procedure  returns  in  the  second 
arqument  the  pointer  to  the  precedence  matrix  list, 
adjmptr,  for  the  Ith  dictionary  entry. 

6.2. 3.6  The  ACRDRS  Procedure  (Set  PM  Entt^  List  Ptr . ) . 

The  routine  ACRDRS ( I , PTR)  is  used  to  set  the  Ith 

adjmptr  with  the  pointer  given  in  the  second  argument, 

PTP . 

6. 2. 3. 7 The  ACRDFP  Procedure  (Return  First  Stack  Pointer). 

The  function  ACROFP  returns  the  oointer  to  the  first 

i 

stack  of  the  dictionary  entries  (FIRST  PTR  in  the  fiqure 
described  in  the  beginning  of  section  6.2.4). 

a 

r 
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•j.3  The  Precedence  Matrix. 

The  Precedence  -iatrix  (Weighted  Vliacency  Matrix)  is  a 
non-oictor ial  representation  of  the  directed  oraoh  that 
represents  the  precedence  relationships  between  the  data 
names  and  the  assertion  names  in  the  MODEL  soecif ication. 
It  is  a matrix  of  size  nxn,  where,  n is  the  number  of 
dictionary  entties  used  in  the  MODEL  specification.  If 
any  element  (i,j)  of  the  precedence  matrix  is  nonzero  then 
the  j th  dictionary  entry  is  said  to  be  the  successor  of 
the  i th  dictionary  entry.  The  (i,i)  th  element  in  the 
matrix  represents  the  tvoe  of  relation  between  the  two 
dictionary  entties.  The  allowed  types  of  relations  are 
described  in  section  4.12.  This  section  describes  the 
orqanization  of  the  precedence  matrix. 

To  conserve  soace  for  savina  the  nxn  matrix  and  to 
minimize  time  reouited  to  search  am  enttv  in  the 
precedence  matrix,  the  matrix  is  not  represented  in  the 
form  of  an  array,  but  with  n different  lists.  The  i th 
list  contains  a list  of  all  the  entries  in  the  i th  row 
and  the  i th  column  of  the  precedence  matrix.  The  address 
of  this  list  can  be  obtained  from  the  i th  dictionary 
entiy  (the  toutine  can  be  used  foi  this  ouroose. 
See  section  6.2.4).  The  format  of  the  precedence  matrix 
entiv  list  con esoondinq  to  the  k th  dictionary  entry  is 
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qiven  in  the  following: 

I + + ++ + ++ + ++ +| 

I alloc  I l used | cused | I col ( 1 ) I c value | | cow ( 1) ! l value  I | ...  I I 

I I I II  1(1)11  1(1)11  || 

| + + ++ -+ ++- +- ++ + 1 

1+ ++ + ++ + ++ + 1 

II  . . . I I col (i ) Icvalue I I low ( i ) I rvalue  I I ...II 

II  II  I (i)  I I 1(1)11  II 

1+ ++ + ++ + ++ +1 

where 

"alloc"  is  the  maximum  number  of  column  or  row  entries  in 
this  list  (that  is,  alloc  >-  MAX (r used, cused) ) . 

"fused"  is  the  number  of  entries  in  the  k th  row. 

"cused"  is  the  number  of  entries  in  the  k th  column. 
"col(i)"  and  "cvalue(i)"  (i  = 1 to  rused)  is  the  column 
number  and  the  cor resoondino  matrix  entry  (in  k th  row 
and  col(i)  th  column)  respectively. 

"row(i)"  and  "rvalue(i)"  (i  = 1 ro  cused)  is  the  tow 

number  and  the  cor respondinq  matrix  entry  (in  row(i) 
th  row  and  k th  column)  respectively. 

As  an  example,  if  the  entries  (5,100),  (5,8)  and  (10,5) 
of  the  precedence  matrix  are  3,  4,  and  1 respectively, 

ther  the  5th  precedence  matrix  entry  list  will  look  as 


follows 
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I — +—+—++ — + — ++ — + — +t — + — ++ — + — +1 
I I I II  I II  I II  I II  I II 
I 2 | 2 | 1 | | 1001  3 I I 101  1 I | 8 I 4 | | - | - | | 

I — + — + — H — +— H — + — H — + — H — + — +1 


The  8 th  precedence  matrix  entry  will  look  as  follows: 


— +- 

+— 

-++- 

— +- 

— ++- 

— H- 

— +- 

— H- 

— +- 

— +1 

1 

1 

1 1 

1 

1 1 

1 

1 1 

1 

1 1 

1 

1 1 

2 I 

0 I 1 

1 1 

- I 

- 1 1 

5 I 

4 11 

- 1 

- 1 1 

- 1 

- 1 1 

+— . 

-++- 

— +- 

— H- 

— I- 

-If- 

— +- 

— ++- 

— +1 

The  10th  precedence  matrix  entry  will  look  as  follows: 


— +- 

+ 

-++- 

— ++— 

-++- 

-++— 

-+l 

1 

1 

1 1 

1 

1 1 

1 

1 1 

1 

1 1 

1 

1 1 

2 I 

1 1 0 

1 1 

5 I 

1 II  - 

1 - 

1 1 

- 1 - 

1 1 - 

1 - 

1 1 

— ■ +- 

— + — 

-++- 

— I- 

—++— 

-++- 

-H— 

-+l 

And  finally,  the  100  th  precedence  matrix  entry  will  look 
as  follows: 


I + + H + H H + +1 

I I I II  I II  I II  I II  I II 

I2I0I1II-I-II5I3II-I-II-I-M 
I — + — + — H — + — H — + — H — + — ++ — + — +1 


6.3.1  Creating  and  Accessing  the  Precedence  Matrix. 


This  section  describes  the  routines  used  to  create  and 
access  the  precedence  matrix.  The  routines  used  are: 

(1)  ADJMRS 

(2)  ADJMCS 

(3)  ADJMSET 
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(4)  ADJM 

6. 3. 1.1  The  ADJMRS  Procedure  (Set  a Row  of  PM) . 

The  ADJMRS ( I, V)  procedure  updates  the  precedence 
matrix  entry  list  of  the  Ith  dictionary  entry  with  the 
value  specified  by  the  second  argument,  V.  V is  a matrix 
of  size  2xN.  The  first  row  of  V specifies  the  column 
numbers  and  the  second  row  specifies  the  values  that  the 
corresponding  columns  in  the  first  row  must  have.  For 
example,  if  V is  given  by  the  following  matrix: 

10  5 2 
3 6 1 

then  the  following  call: 

CALL  ADJMRS ( 12, V) 

will  set  the  values  3,  6,  and  1 to  the  locations  (12,10), 
(12,5)  and  (12,2)  respectively. 

6. 3. 1.2  The  ADJMCS  Procedure  (Set  a Column  of  PM). 

The  ADJMCS (I, V)  PROCEDURE  is  similar  in  function  to 
the  procedure  ADJMRS.  In  this  case  I represents  the 
column  number  (I  represented  the  row  number  in  the  case  of 
ADJMRS).  In  other  words,  ADJMCS  is  used  to  set  one  or 
more  entries  of  the  Ith  column  of  the  precedence  matrix. 
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6. 3. 1.3  The  ADJMSET  Procedure  (Set  an  Entry  of  PM) . 

The  ADJMSET ( I ,J , K)  procedure  is  used  to  set  the  value 
K to  the  entry  (I,J)  of  the  Precedence  matrix. 

6. 3. 1.4  The  ADJM  Procedure  (Return  an  entry  of  PM) . 

The  Function  ADJM(I,J)  returns  the  entry  (value)  in  the 
Ith  row  and  the  Jth  column  of  the  precedence  matrix.  If 
the  entry  (I,J)  is  undefined,  then  'O'  is  returned.  Also, 

If  I = 0,  then  it  returns  the  number  of  nonzero  entries  in 
the  Jth  column,  and  if  J * 0,  then  it  returns  the  number 
of  nonzero  entries  in  the  Ith  row. 

6.4  The  Derivation  Tree  of  an  Assertion. 

For  ease  of  analyzing  an  assertion,  the  derivation  tree 

•» 

(DT)  of  each  assertion  is  saved  in  the  data  area  of  the  > 

assertion.  The  DT  is  a list  of  entries  that  represent 
each  production  in  the  EBNF  for  the  assertion.  The  root  t 

of  the  DT  is  saved  in  ASS_PTR  of  the  extended  data  area 

(see  section  6. 1.4. 3).  The  different  entries  used  in  the 

- ■ 

DT  of  an  assertion  is  given  in  the  following  sections. 

Some  examples  of  DTs  are  presented  in  the  section  6.4.10. 
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6.4.1  The  Assertion  Entry. 

The  pointer  ASS_PTR,  in  the  extended  data  area  of  an 
assertion  points  to  the  root  of  the  dirivation  tree  of  the 
assertion.  This  root  entry  has  the  following  format: 


I + +- + + | 

I if-ptr  las-ptr(l)  I sas-ptr (1 ) I as-ptr (2)  |sas-ptr(2)| 
| + + + + I 


where 

"if-ptr"  is  a pointer  to  the  " if-clause"  entry.  If  no 
"if-clause"  exists  then  the  pointers:  if-ptr, 

as-ptr (1),  as-ptr  (2),  and  sas-ptr (2)  are  all  null, 
"as-ptr (1)"  is  pointer  to  the  next  assertion  entry 
(similar  to  the  above  entry) , if  the  assertion 
contains  nested  "if-clause"s. 

"sas-ptr (1)"  is  a pointer  to  the  "simple  assertion"  entry, 
"as-ptr (2)"  is  pointer  to  the  next  "assertion"  entry 
(similar  to  the  above  entry)  if  the  assertion  contains 
nested  "if-clause"s  in  the  "else"  part  of  the 
assertion. 

"sas-ptr (2)"  is  a pointer  to  the  "simple-assertion"  entry 
in  the  "else"  part  of  the  assertion. 

For  example,  if  the  assertion  is  given  by: 


i 

i 


( 


■y 


IF  A«B  THEN  IF  A»C  THEN  X«Y  ELSE  X*Z 


■ 


then  if-ptr  points  to  the  DT  of  the  boolean  expression 
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"A=B",  as-ptr(l)  points  to  the  DT  that  represents  the 
assertion  "IF  A=C  THEN  X=Y  ELSE  X=Z",  and  the  pointers 
sas-ptr(l),  as-ptr(2)  and  sas-ptr(2)  are  all  null. 

As  another  example,  for  the  assertion: 

IF  A=C  THEN  X=Y  ELSE  X=Z 

if-ptr  points  to  the  DT  of  the  boolean  expression  "A=C", 
sas-ptr(l)  points  to  the  DT  of  the  simple  assertion  "X=Y", 
and  sas-ptr(2)  points  to  the  DT  of  the  simple  assertion 
"X*Z".  The  pointers  as-ptr(l)  and  as-ptr(2)  are  both 
null . 

Some  more  examples  of  the  derivation  trees  are  given  in 
the  section  6.4.10. 

6.4.2  The  Simple  Assertion  Entry. 

The  simple  assertion  entry  has  the  following  format: 

I — + I 

I lhs-ptr  I rhs-ptr  I 
I- + --I 

i 

where 

"lhs-ptr"  is  a pointer  to  the  DT  of  the  arithmetic 
expression  in  the  LHS  of  the  assertion, 
and  "rhs-ptr"  is  a pointer  to  the  DT  of  the  arithmetic 
expression  in  the  RHS  of  the  assertion. 
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6.4.3  The  Aiithmetic  Expression  Entry. 

The  arithmetic  expression  entry  has  the  following 
format : 


| + +- | 

I code  I opr  Iterms-ptr  I 
| + +- — I 


where 

"code"  indicates  the  type  of  DT  pointed  by  terms-ptr, 

"opr"  is  a monadic  operator,  if  any, 

and  "terms-ptr"  points  to  one  of  the  9 entries  depending 
on  the  value  of  code.  The  different  types  of  DTs  that 
terms-ptr  points  to,  is  given  in  the  following  table. 

TABLE  6.4 


code 

1 DT  pointed  by  terms-ptr 

A 

T 

lArithmetic  expression 

R 

(Expression  using  relational  operators 

1 

(Expressions  using  * 1 ' 

& 

(Expressions  using  ' 

C 

(Expressions  using  'll' 

S 

(Expressions  using  '+'  or 

T 

(Expressions  using  '**  or  '/' 

F 

(Expressions  using  '**' 

I 

1 Integer 

M 

1 Misc-data 

V 

(Variable  name 

6.4.4  The  Diadic  Expression  Entry. 


183 


The  expressions  using  the  diadic  operators  ’I', 

'll’,  '/'#  '**'  and  the  relational 

operators  have  the  same  format  and  is  given  by: 

| + + + + I 

lleft-ptr  llcodelopr  Ircode  Iright-ptr  I 

I + +~— + + 1 


where 


"left-ptr"  points  to  one  of  the  9 entries  depending  on  the 
value  of  code  (as  in  the  case  of  arithmetic  expression 
entry)  , 


"lcode"  indicates 

the 

type  of 

pointer 

represented 

by 

left-ptr  (has 

the 

same  meaning  as 

the  code 

of 

arithmetic  expression 

entry)  , 

"rcode"  indicates 

the 

type  of 

pointer 

represented 

by 

"right-ptr"  (has  the  same  meaning  as  the  code  of 
arithmetic  expression  entry) , 

"opr"  represents  one  of  the  diadic  operators:  I,  &,  II,  +, 

" t * , /,  **,  ^ ^ ^ = f )*»  ~ I =, 

"right-ptr"  is  a pointer  to  RHS  of  the  diadic  expression. 
If  tcode  ■ 'X'  then  right-ptr  points  to  an  extended 
area  which  is  used  to  save  the  upper  bound  and  increment 
associated  with  the  for  expression.  This  extended  area 
has  the  following  format: 


I 

\ 

t 

t 

f 

r 

r 


\ 


* 
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| + + + 

Ixcode  I rptr  Itoptr  Ibyptr 

| + + + 


where 

"xcode"  indicates  the  type  of  pointer  represented  by  rptr, 
"rptr"  is  a pointer  to  the  RHS  of  the  diadic  expression, 
"toptr"  is  a pointer  to  the  arithmetic  expression  that 
specifies  the  upper  bound,  and 
"byptr"  is  a pointer  to  the  arithmetic  expression  that 
specifies  the  increment  associated  with  the  subscript 
name. 

6.4.5  The  Integer  Entry. 

The  entry  for  "integer"  represents  an  inteqer  constant 
and  has  the  following  format: 


lvalue  I 

| 1 

where  "value"  is  the  integer  constant  itself. 

6.4.6  The  Misc-data  Entry. 

The  entry  for  "Misc-data"  represents  bit-strings, 
character  strings  and  floating  point  data  constants  and 
has  the  following  format: 
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| + 4. | 

ItyDe  | length Idata  I 

I — + + | 


where 

"type"  specifies  the  type  of  data  represented  by  data, 
"length"  is  the  length  of  the  data  that  follows, 
and  "data"  is  BIT  (if  type  * 'B')  01  character 

representation  (if  type  = 'C'),  ol  floating  ooint 

representation  (if  type  = ’F’)  of  the  data. 

6.4.7  The  Entry  for  Variable  Names . 

The  entry  for  variable  names  has  the  following  format: 

I + I 

I length  I variable  name  I 

I— +- 1 

where 

"length"  is  the  length  of  the  variable  name  that  follows, 
and  "variable  name"  is  the  name  of  the  variable  itself. 

6.4.8  The  Variable  Entry. 

The  variable  entry  represents  a subscripted  variable 
name  and  has  the  following  format: 


— 
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| + + + + I 

I nam-ptr  In  I ae(l)  I . . . I ae(n)  I 

| + + + + I 

where 

"nam-ptr"  is  a pointer  to  the  variable  name  entry  (see 
section  6.4.7) , 

"n"  is  the  number  of  subscripts  associated  with  the 
variable  name, 

and  "ae(i)"  (i  = 1 to  n)  is  a pointer  to  the  arithmetic 
expression  entry  that  represents  the  i th  subscript. 

6.4.9  The  Boolean  Expression  Entry. 

The  boolean  expression  entry  has  the  same  format  as  the 
arithmetic  expression  entry  described  in  section  6.4.3. 
The  LHS  or  RHS  of  diadic  expressions  that  use  ' I ' or 
as  operators,  specify  individual  for  expression  or  boolean 
expression.  In  the  case  of  for-expressions,  the  diadic 
expression  entry  can  have  extended  information  (rcpde  = 
'X')  as  explained  in  section  6.4.4. 

6.4.10  Some  Examples  of  Derivation  Trees . 

In  the  following,  the  derivation  trees  for  some 
assertions  are  qiven.  The  assertions  used  are: 

EXAMPLE1:  A * B + C 


EXAMPLE2 : 


IF  A-B  THEN  OD(I)  ELSE  C - E 
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EXAMPLE 3 : FOR  I = 1 TO  50  BY  2 A(I*10+2)  =B(I) 

The  derivation  trees  for  the  above  examples  are  given 
in  Figures  6.2,  6.3  and  6.4  respectively. 


\ 


-y 


uff 


191 


6.5  The  Precedence  Relations. 

In  this  section,  the  allowed  precedence  relations 
between  the  data  names  and  the  assertion  names  (the 
different  entries  of  the  precedence  matrix)  are  described: 
Essentially,  there  are  three  groups  of  relationships. 

(1) .  Hierarchical  Relationship  - This  is  a relationship 

between  a data  name  and  its  descendant  (or  its 
parent) . 

(2) .  Value  Dependency  Relationship  - This  is  a 

relationship  between  a data  name  and  an  assertion 
name. 

(3) .  Pointing  Relationship  - This  is  a relationship 

between  an  exist,  length  or  pointer  name  and  a data 
name. 

The  entries  in  the  precedence  matrix  can  contain  the  9 
types  of  relationships , which  fall  into  one  of  the  above 
categories.  These  types  of  relations  are  described  in  the 
following: 

Typel:  This  type  is  a hierarchical  relationship  between 
the  source  data  names  and  its  descendants. 

Type2:  This  type  is  also  a hierarchical  relationship  and 
is  between  the  target  data  names  and  their  parent. 
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Type3:  This  type  is  a value  dependency  relationship 
between  the  source  variables  of  an  assertion  and 
the  assertion  itself. 

Type4:  This  type  is  also  a value  dependency  relationship, 
and  is  between  an  assertion  and  target  variables 
used  in  that  assertion. 

Type5:  This  type  is  a pointing  relationship  between  a 
"length  name",  L,  and  a data  name  the  length  of 
which  is  given  by  L. 

Type6:  This  type  is  also  a pointing  relationship,  and  is 
between  an  "exist  name",  E,  and  a data  name,  the 
size  of  which  depends  on  E. 

Type7:  This  type  is  also  a pointing  relationship  and  is 
between  a "pointer  name",  P,  and  a repeating  data 
name,  the  selection  of  a particular  member  of  which 
is  determined  by  P. 

Type8:  This  type  is  a hierarchical  relationship  between 
entries  for  different  subscripted  representations 
for  an  interim  or  target  name. 

Type9:  This  type  is  a hierarchical  relationship  that 
specify  adjacency  of  two  data  names  in  a media. 

As  an  example,  for  the  foilwing  set  of  MODEL 

statements : 


I 
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N IS  INTERIM 
AO:  N = 10 
Al:  A.B  = C 
A IS  GROUP 

B IS  FIELD (A, CHAR (N) ) 

C IS  FIELD (A, CHAR  (5) ) 

the  precedence  matrix  will  contain  the  following  entries: 


1 

A 

B 

C 

N 

AO 

Al 

A 

1 

0 

0 

0 

0 

0 

0 

B 

1 

2 

0 

0 

0 

0 

0 

C 

1 

2 

0 

0 

0 

0 

3 

N 

1 

0 

5 

0 

0 

0 

0 

AO 

1 

0 

0 

0 

4 

0 

0 

Al 

1 

0 

4 

0 

0 

0 

0 

where  it  is  assumed  that  A and  B are  target  variables  and 
C is  a source  variable. 

6.6  The  Directed  Graph. 

A directed  graph  is  a graphical  representation  of  the 
precedence  matrix.  Each  node  of  the  directed  graph 
represents  either  a data  name  or  an  assertion  name.  Thus, 
the  number  of  nodes  in  the  directed  graph  (for  a model 
specification)  is  equal  to  the  number  of  dictionary 

I * & 

entries.  An  arc  is  drawn  between  the  two  nodes  that 
represent  I th  and  J th  dictionary  entries,  if  the  J th 
dictionary  entry  is  a successor  of  the  I th  dictionary 
entry.  The  label  on  each  arc  represents  the  type  of 
relationship  between  the  two  nodes. 


As  an  example,  the  directed  graph  for  the  simple 
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example  given  in  the  previous  section  is  shown  in  Figure 
6.5.  In  this  graph,  circles  represent  variable  nodes,  and 
the  squares  represent  assertion  nodes. 

In  general,  a node  of  a directed  graph  can  represent  an 
array  of  elements  which  will  be  represented  by  the 
following  notation: 

A(l,u,c) 

Where  A is  a varaible  name,  and  1,  u and  c define  the 
range  of  elements  represented  by  the  node.  The  above 
notation  represents  the  following  set  of  elements  of 
variable  name: 

A ( 1 ) , A ( 1+c ) , A(l+2.c) , ...,  A (1+k .c) 
where  k = (u-l)/c.  Because  a node  can  represent  an  array 
of  elements,  we  will  also  call  our  directed  graph  as  an 
array  graph.  A Formal  definition  of  array  graph  is  given 
in  chapter  9. 


1 


■k 

i 


Figure  6.5.  Directed  Graph  for  the  MODEL  statements 

N IS  INTERIM; 

AO;  N - 10; 

Al:  A.B  - C; 

A IS  GROUP; 

B IS  FIELD(A, CHAR(N) ) ; 

C IS  FIELD(A, CHAR(5)) ; 
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The  directory  consists  of  an  entry  for  each  member  in 
the  library,  which  consists  of: 

(1)  name  of  the  member, 

(2)  the  line  number  of  the  first  line  of  data  for  this 
member,  and, 

(3)  the  line  number  of  the  last  line  of  data  for  this 
member. 

The  data  area  consists  of  input  statements  of  all  the 
members  in  the  library. 

When  a member  is  added  to  the  library,  the  statements 
for  that  member  are  padded  to  the  end  of  data  area;  and  an 
entry  for  the  memebr  name  is  added  to  the  directory.  The 
summary  record  is  also  updated. 


When  a 

member 

is  deleted 

from 

the  library. 

the 

statements 

of 

that 

member  are 

not 

deleted,  but 

the 

directory 

entry 

for 

that  member 

is  changed  to  a 

dummy 

entry. 

Also, 

the 

summary  record 

is  updated 

by 

incrementing  the 

number  of  members 

deleted. 

The  library 

maintenance  subsystem 

consists  of 

two 

subroutines:  (1)  LI BCR  which  is  used  to  create  a new  MODEL 
library,  and  (2)  LIBU,  which  is  used  to  update  the  MODEL 
library.  In  the  following,  these  routines  are  described 
in  more  detail. 
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7.2  The  LIBCR  Procedure.  i 

The  LIBCR (#DIR)  is  the  procedure  to  create  a MODEL 
library.  The  parameter,  #DIR,  specifies  the  number  of 
directory  blocks  to  be  used  in  the  library.  The  functions 
of  this  procedure  are: 

(1)  to  write  the  summary  record, 
and  (2)  to  write  dummy  directory  blocks. 

If  no  parameter  is  specified,  then  a default  of  two  blocks 
is  used  for  the  directory.  The  number  of  directory 
entries  in  a block  can  be  obtained  using  the  following 
formula : 

# of  directory  entries  per  block  = 

(block  size  -2)/27. 

Block  size  is  usually  the  track  length  of  the  medium  in 

which  the  library  resides.  As  an  example,  if  block  size  , 

is  2048,  then  the  above  formula  yields  75  directory 
entries  in  a directory  block.  The  library  is  organized  as 

'I 

a Regional (1)  data  set.  \ 

7.3  The  LIBU  Procedure. 

The  LIBU  procedure  is  used  to  update  the  MODEL  library. 

It  can  perform  the  following  functions: 

(1) .  Add  a member  to  the  library. 

* 

(2) .  Delete  a member  from  the  library. 


I 
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(3).  Dump  the  contents  of  the  library, 
and  (4).  Compress  the  library  and  create  a new  one  after 
eliminating  all  the  holes  (created  by  the 
deletion  of  members)  in  the  library. 

Figure  7.1  shows  a functional  overview  of  the  LIBU 
procedure.  It  obtains  input  primarily  from  SYSIN  file. 

The  sysin  file  consists  of  two  types  of  statements:  (1) 

data  statements  and  (2)  control  statements.  A statement  \ 

in  the  SYSIN  file  is  a control  statement  if  it  is  one  of 

the  following  types:  j 

(1) .  ADD  member-name  ] 

(2) .  ^DELETE  member -name  j 

(3) .  jrfDUMP 

(4) .  0COMPRESS 

(5) .  jrfSEC  section-name 


.i. 
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The  first  four  types  of  control  statements  perform  the 
four  functions  mentioned  in  the  beginning  of  the  section. 

The  last  control  statement  is  used  to  divide  the  input 
statements  into  sections.  This  division  allows  the  REFER 
statements  of  MODEL  to  include  a subset  of  statements  of 
the  member  (a  section).  All  other  statements  are  assumed 
to  be  the  input  statements.  Data  statements  are 
associated  with  only  the  ADD  control  statement. 

Therefore,  if  an  ADD  statement  is  followed  by  a set  of 
data  statements  (and  SEC  control  statements) , then  those 
set  of  statements  are  saved  in  the  library  with  the  member 
name  specified  in  the  ADD  statement. 

When  the  SYSIN  file  contains  a COMPRESS  statement,  each 
member  in  the  MODEL  library  is  copied  to  a new  library  as 
shown  in  Figure  7.1.  The  new  library  must  have  been 
previously  created  by  the  LIBCR  procedure  described  in 
section  7.2.  The  reason  for  creating  a new  library  rather 
than  updating  the  old  one  itself  is  to  guard  against  any  • 

system  failure  during  the  compress  operation,  which  could 
clutter  the  library.  Therfore,  to  compress  a library,  the 
following  steps  must  be  followed: 

(1) .  Create  a new  library  using  the  procedure  LIBCR. 

(2) .  Compress  the  old  library  and  update  the  new 

library  using  the  procedure  LIBU. 
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(3) .  If  the  compress  operation  was  successful  in  step 

(2) , scratch  the  old  library,  and  rename  the  new 
library  name  appropriately. 

(4) .  If  compress  was  not  successful  for  some  reason, 

repeat  steps  (1)  thru  (4)  again. 

ADD  ASS1 

ASS1:  A = B + C; 

ADD  INT 
INT  IS  GROUP; 

A IS  FIELD ( INT) ; 

B IS  FIELD ( INT) ; 

DELETE  OLDMEM 
ADD  NEWA 
NEWA:  A = X * Y; 

Figure  7.2.  A Sample  Input  for  LIBU  Procedure. 

Figure  7.2  shows  a sample  input  for  the  LIBU  procedure. 
In  this  example,  the  procedure  LIBU  will  add  3 members, 
ASS 1 , INT,  and  NEWA  to  the  library  and  deletes  the  member 
OLDMEM  from  the  library.  Note  that  all  the  control 
statements  start  at  column  2. 

The  routines  LIBCR  and  LIBU  are  not  part  of  MODEL 
system.  They  are  separate  stand  alone  procedures. 
However,  the  MODEL  library  can  be  accessed  by  the  MODEL 
system  during  the  syntax  analysis  phase  (while  processing 
REFER  statements)  and  updated  during  the  documentation 
phase,  when  the  complete  MODEL  specification  is  saved  as  a 
member  in  the  library. 
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CHAPTER  8 

I 

DETERMINATION  OF  PRECEDENCE  RELATIONSHIPS 

t 

Once  the  syntax  analysis  is  complete,  the  statements 
stored  in  the  associative  memory  are  analyzed  in  Phase-2 
of  the  MODEL  Processor.  This  analysis  produces  (1)  a 
Dictionary  of  all  names  used  in  the  MODEL  specification 
and  (2)  a Precedence  Matrix  that  represents  the  precedence 
relationships  between  different  dictionary  entries. 

The  dictionary  is  similar  to  the  directory  of  the 
associative  memory,  but  contains  mote  information  than  the 
directory.  There  can  be  more  than  one  dictionary  entry 
for  a data  name  represented  by  a directory  entry.  There 
exists  one  dictionary  entry  for  each  node  in  the  directed 
graph  that  represents  the  relationships  between  different 
data  items  used  in  the  specification. 

The  organization  of  dictionary  and  the  Precedence 
Matrix  was  explained  in  chapter  6.  Figure  8.1  gives  an 
over?:  -w  of  the  Phase-2  of  the  MODEL  Processor.  It  is 

r J \ 

shown  as  consisting  of  two  parts: 

(1).  Generation  of  dictionary, 
and  (2).  Generation  of  Precedence  Matrix. 
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GENERATED 
STATEMENTS 
AND  ERRORS 


GENERATE  DD  STATEMENTS  FOR  ASSOCIATIVE 

NAMES  PREFIXED  BY  '$P',  ’IQ’H-ej  MEMORY 
•#E'  AND  'll,'  (GEN$P#Q)  \ j 


CREATE  A DICTIONARY  ENTRY 


Figure  8.2.  Overview  of  generating  the  Dictionary. 
(Procedure  ACRDICT) . 
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Figure  8.3.  Overview  of  generating  Precedence 
Matrix  (Procedure  ACRADJM). 
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Figure  8.2  shows  the  different  steps  involved  in 
creating  the  dictionary.  And,  Figure  8.3  shows  the 
different  steps  involved  in  creating  the  Precedence 
Matrix.  The  functions  of  different  steps  shown  in  Figures 
8.2  and  8.3  are  described  in  the  following  sections. 

8.1  Generating  the  Dictionary  (Procedure  ACRDICT) . 

The  first  step  during  the  network  analysis  is  to 
generate  a dictionary  of  all  names  supplied  by  the  user. 
The  names  include  data  names  and  the  assertion  names. 
Algorithm  A8.1  describes  the  routine  ACRDICT  which  is  used 
to  create  the  dictionary  from  the  MODEL  specification.  It 
uses  the  contents  of  the  associative  memory  created  during 
the  syntax  analysis  phase.  The  process  of  generating  the 
dictionary  consists  of  the  following  steps  as  shown  in 
Figure  8.2: 

(1) .  Generating  dd  statements  for  the  parent  names  for 

which  no  dd  statements  were  specified. 

(2) .  Generating  data  description  statements  for  the 

successor  of  group  and  record  names  for  which  no 
successor  exists. 

(3) .  Creating  a dictionary  entry  for  each  data  name  and 

assertion  name  in  the  directory. 

(4) .  Creating  a dictionary  entry  for  each  data  name  used 

in  the  assertion,  after  generating  an  appropriate  dd 

X 
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statement,  if  necessary. 

(5).  Analyzing  each  subscript  statement  and  associating  a 
parent  name  for  that  name. 

(8).  Resolving  each  data  name  as  source  or  target. 

(7).  Generating  duolicate  dictionary  entries  for  update 
data  names  (names  that  are  descendents  of  an  update 
file) . 

Also,  some  pointers  in  the  data  area  of  each  statement 
are  updated.  These  pointers  establish  links  between  the 
data  name,  its  parent,  and  its  daughters.  The  dictionary 
creation  starts  first  with  media  names,  then  file  names, 
record  names,  group  names,  field  names,  interim  names, 
subscript  names  and  finally,  the  assertions.  This  type  of 
ordering  allows  easy  linking  of  data  names  with  their 
parent. 

During  the  analysis  of  an  assertion  (in  steo  4 of  A8.1) 
some  additional  assertions  could  be  generated  (for 
example,  while  simplifying  the  subscripts  to  the  "standard 
form").  The  storage  entry  pointers  of  these  generated 
assertions  are  saved  in  a stack.  If  the  stack  is  not 
empty,  those  generated  assertions  are  also  analyzed  and 
the  source  and  target  variables  used  in  them  are  added  to 
the  dictionary  (step  7 in  A8.1). 

The  routines  GEN$P#Q,  ACRDUP,  ACRPTR,  ACRSTVS,  ASUBST, 


d 
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RSORT  and  DUPLUDE  used  in  alqorithm  AS.l  are  described  in 
greater  detail  in  the  sections  that  follow. 


Algorithm:  A8.1,  ACRDICT . 

Function:  Creates  a dictionary  of  all  names  used  in  the 
MODEL  specification. 

Calls:  GEN$P#Q,  ACRSTVS,  ACRPTR , ACRDUP,  ASUBST,  RSORT, 
DUPLUDE. 

Called  by:  MONITOR. 

Stepl:  Using  the  routine  GEN$P#Q,  generate  dd  statements 

for  all  those  names  that  are  prefixed  with  '$P', 
'#Q',  ' # L ' , or  ' #E  ' , if  no  dd  statement  exists  for 
those  names. 

Step2:  Using  the  routine  RETREVE,  get  all  the  storage 

entries  for  the  data  names  and  the  assertion  names 
(that  is,  media,  file,  tecord,  group,  field, 
interim,  and  assertion  names,  in  that  order).  For 
each  storage  entry  thus  obtained,  perform  steps  3 
thru  6. 

Step3:  Using  the  routine  ACRDUP,  add  the  data  name  to  the 

dictionary. 

Step4:  If  the  data  name  is  an  assertion  name,  then,  usinq 

the  routine  ACRSTVS,  analyze  the  assertion  and  get 
a list  of  all  source  and  target  variables  used  in 
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the  assertion?  and  also,  add  each  of  the  name  in 

f 

the  list  to  the  dictionary. 

Step5:  If  the  data  name  is  not  an  assertion  name,  then, 

r 

using  the  routine  ACRPTR,  obtain  all  the  length, 
exist  and  pointer  names  associated  with  the  data 
name  and  add  them  to  the  dictionary. 

Step6:  Continue. 

Step"’:  If  any  assertion  is  generated  during  the  analysis 

of  the  assertion  in  step  4,  then  the  storage  entry 
pointers  of  those  generated  assertions  are  saved 
in  a stack.  If  the  stack  is  empty  then  return; 
otherwise,  perform  step  4,  for  each  storage  entry 
in  the  stack. 

Step8:  Now,  using  the  routine  RETREVE,  get  the  storage 

entries  of  all  subscript  statements.  Analyze  each 

obtained  entry  using  the  routine  A3UBST.  f 

Step9:  Using  the  routine  RSORT , resolve  each  data  name  as 

j 

source,  target,  or  interim. 

SteplO:  Using  the  routine  DUPLUDE,  create  duplicate 
dictionary  entries  for  "update"  data  names  (data 
names  that  are  both  source  and  tarqet). 

Stepll:  Return. 


211 


I 

i 


I 

l 


8.1.1  The  GEN$P#Q  Procedure. 

If  no  dd  statement  exists  for  a patent  name  of  a data 
name,  then  the  MODEL  specification  is  incomplete.  In  such 
cases,  an  appropriate  dd  statement  is  generated  for  the 
parent  name.  For  the  parent  of  a file,  a media  statement 
is  generated;  in  all  other  cases,  a group  statement  is 
generated.  As  an  example,  if  the  following  statements 
were  specified: 

A IS  FIELD (B) 

B IS  GROUP (C) 

and  no  dd  statement  for  the  data  name  C was  specified, 
then  the  following  group  statement  is  generated: 

C IS  GROUP 

Note  that  this  statement  does  not  have  any  parent,  this 
incompleteness  is  resolved  later  (see  the  description  of 
the  routine  RESIC). 

The  routine  GEN$P#Q  does  the  above  function  of 
generating  data  description  statements  for  the  parent 
names.  The  algorithm  A8.2  describes  the  functions  of  the 
routine  GEN$P#Q. 

The  routine  GENSTMT,  used  in  step  4 of  the  algorithm  is 
described  in  the  following. 
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Algorithm  A8.2:  GEN$P#Q. 

Function:  Generates  dd  statements  for  parent  names  of  data 
names  for  which  no  dd  statements  were  specified. 

Calls:  RETRPRX , RETRNAM , GENSTMT  and  RETREVE 

Called  by:  ACRDICT. 

Stepl:  Using  the  routine  RETRPRX,  get  all  the  names  in 

the  directory  that  are  prefixed  by  either  ’$P*  or 
'#Q ' . Save  all  such  names  in  NAM_LIST. 

Step2:  Perform  steps  3 thru  5 for  I = 1 to  # of  names  in 

NAM_LIST. 

Step3:  Using  the  routine  RETRNAM,  check  if  the  Ith  name 

in  NAMELIST  has  a dd  statement.  If  there  is,  then 
goto  step  5. 

Step4:  Using  the  routine  RETREVE,  find  the  statement  in 

which  the  NAMELIST (I)  is  defined.  If  the 
statement  is  a file  statement,  then  generate  a 


Media  statement. 

otherwise , 

if  the  name  is 

prefixed 

by 

'#L\ 

or  *#E ' , 

then 

generate  an 

interim 

statement 

for  the 

name , 

otherwise , 

generate 

a 

group 

statement 

for 

the  name 

NAMELIST ( I) . 

Use 

the  routine 

GENSTMT  for 

generating  the  dd  statement. 
Step5:  Continue. 
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Step6:  Return. 


8. 1.1.1  The  GENSTMT  Procedure  (Generate  DD  Statement ) . 

The  GENSTMT (N,T, ST, P)  procedure  is  used  to  generate  a 
dd  statement  for  the  name  N,  the  parent  name  of  which  is 
given  by  P.  The  type  of  DD  statement  to  be  generated  is 

given  by  T.  ST  is  'S'  or  *T ' according  as  the  name  N is  a ■ 

source  name  or  a target  name  respectively.  The  functions 
of  the  routine  GENSTMT  ate  summarized  in  the  following: 

(1) .  If  T is  'C',  ’G',  'I',  'L ' or  'B'  then  the  statement 

to  be  generated  is  for  a record,  group,  interim, 
field  or  subscript  name  respectively.  All  these 
types  of  statements  have  the  same  data  area  (DFIELD, 
see  the  description  of  the  extended  data  area  in 
section  6. 1.4. 5).  Therefore,  DFIELD  is  allocated  and 
the  entries  in  it  are  set  to  the  appropriate  default 
values.  A 

(2) .  If  T is  'F*  or  'M',  then  the  statement  to  be 

generated  is  a file  or  media  statement  respectively. 

; 

These  two  types  of  statements  have  the  same  data  area 
(DMEDIA,  see  the  section  6. 1.4. 4).  Therefore,  DMEDIA 
is  allocated  and  the  entries  in  it  are  set  to  the 
appropriate  default  values. 
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(3).  Using  the  routine  STORE,  the  statement  is  stored  in 

i 

the  associative  memory,  using  P as  the  parent  name. 

8.1.2  The  ACRDUP  Procedure  (Update  Dictionary) . 

The  function  of  the  routine  ACRDUP  is  to  update  the  , 

dictionary  and  also  update  some  pointers  in  the  data  area 
of  the  statement.  The  algorithm  A8.3  describes  the 

functions  of  the  routine  ACRDUP.  They  include  the  4 

following  s 

(1)  Adding  the  data  name  itself  to  the  dictionary, 

(2)  Establishing  link  between  the  data  name  and  its 
patent,  and 

(3)  Determininq  the  data  name  as  either  source  or  target, 
if  that  name  is  a descendant  of  a source  or  a target  file 
respectively. 

As  an  example,  if  the  storage  Dointer  represents  the  f 

following  statement: 

* 

A IS  FIELD (B) 

then  the  routine  ACRDUP  does  the  following  functions.  > 

(1)  It  creates  a dictionary  entry  for  A (step  1). 

(2)  It  obtains  the  parent  of  the  data  name  A (the  name  B) 

j 

in  step  2 and  the  storaqe  entry  pointer  to  that  parent 
name  in  step  3.  Note  that  there  must  be  at  least  one  dd 


statement  for  B (If  there  weren't  any,  a group  statement 


j 
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for  B must  have  been  generated  while  executing  the  routine 
GEN$P#Q;  see  the  description  of  algorithm  A8.2). 

(3)  If  there  was  only  one  dd  statement  for  B,  storage 
entry  pointer  of  B is  stored  in  storage-ptr  ->  data-pt  -> 
data-list  ->  oarent-ptr  (see  foot  note(l)  and  also  the 
description  of  common  data  area  in  section  6. 1.4.1). 
Also,  the  list  of  daughter  names  given  by  the  dtr-ptr  of 
the  parent  name  (storage-ptr  of  B ->  dtr-ptr)  is  undated 
(in  the  routine  SETDTR°)  to  include  the  newly  obtained 
daughter  name  given  by  storage-ptr. 

(4)  If,  for  example,  the  data  name  B is  a descendant  of  a 
source  file,  then  A is  also  a descendant  of  that  source 
file,  then  the  bit  storage-ptr  ->  SRC_BY_DDS  is  set  (in 
step  10)  to  indicate  that  this  data  name  is  a source  data 
name. 

The  term  "closest"  used  in  steo  7 is  defined  as 
follows : 

If  there  are  n parent  names  for  the  data  name  and  u(i)  (i 
* 1 to  n)  is  the  user-line#  of  the  i th  parent  name,  then 
the  "closest"  parent  of  the  data  name  is  either: 

(1)  A ->  B ->  C will  be  used  from  now  on,  to  refer  to  the 
variable  C,  based  on  B which  in  turn  is  based  on  A. 
Note  that  this  is  an  extended  PL/I  notation,  and  if 
there  is  no  confusion,  the  above  notation  will  be 
abbreviated  to  A ->  C;  in  other  words,  the  expression 
storage-ptr  ->  data-pt  ->  data-list  ->  parent-ptr  will 
be  abbreviated  to  storage-ptr  ->  parent-ptr. 
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(1)  the  i th  parent  name  such  that 
u(i)  < u and 

V j = 1 to  n,  j / i,  u(j)  < u(i)  or  u(j)  >u 
or 

(2)  the  i th  parent  name  such  that 
u < u(i)  and 

V j = 1 to  n,  j / i,  u(j)  > u (i)  . 

where  u is  the  user-line#  of  the  data  name  given  by 

< storage-ptr . In  the  above  example,  if  the  user-line#  of  A 

is  10,  and  B has  two  dd  statements  with  the  user-line#s  5 


and  15 

respectively. 

then  the 

first  parent 

name 

is 

selected 

as  the  parent 

of  A (in 

step  7) . This 

type 

of 

situation 

can  happen 

as  shown 

in  the  following 

set 

of 

statements : 

1 

1 

STATEMENT 

1 

1 USER-LINE# 

J. 

1 

1 

1 

• 

1 

1 

1 

1 

1 

IB 

1 

IS  GROUP ( Rl ) 

1 

1 5 

1 

1 

1 

1 

• 

1 

| A 

l 

IS  FIELD (B) 

! 10 

1 

1 

1 

I 

1 

IB 

1 

IS  GROUP ( R2 ) 

1 

1 15 

1 

1 

1 

I A 

IS  FIELD (B) 

1 20 

--+ 

1 

1 

1 

In  this  case,  we  can  safely  assume  that  the  second  and  the 
fourth  statements  really  mean: 

A IS  FIELD (Rl .B) 
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and  A IS  FIELD ( R2 . B) 

respectively . This  type  of  resolving  the  ambiguity  is 

precisely  the  function  of  step  7 in  the  algorithm  A8.3. 

The  routines  GETQNM  and  SETDTRP  used  in  the  algorithm 
is  described  in  the  following  sections. 

The  ACRCP ( QNAME , # I ) procedure  is  a local  procedure  that 
eliminates  all  the  storage  entry  pointers  in  DIRTEMP  that 
do  not  contain  the  name,  QNAME.  The  second  argument  is 
decremented  by  the  number  of  storage  entries  that  do  not 
contain  QNAME.  For  example,  if  QNAME  is  A.B  and  a storage 
entry  in  DIRTEMP  represents  the  name  C.A.B.X,  then  QNAME 
is  contained  in  it,  whereas,  if  the  storage  entry 
represents  the  name  C.B.A.X,  then  QNAME  is  not  contained 
in  it,  so,  that  storage  entry  is  deleted  from  DIRTEMP  and 
#1  is  reduced  by  1. 


Algorithms  A8.3,  ACRDUP (storage-ptr ) 

i 

Function:  Adds  the  names  in  the  storage  entry  (given  by 
storage-ptr)  to  the  dictionary. 

Calls:  ACRDO , GETQNM,  RETRNAM , RETREVE,  ACRCP  and  SETDTRP. 
Called  by:  ACRDICT . 

Stepl:  Using  the  routine  ACRDO,  add  the  data  name 

« represented  by  storage-ptr  to  the  dictionary. 

* Step2:  Using  the  routine  GETQNM,  get  the  parent  names,  if 
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i 
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any,  in  the  storage  entry.  If  no  parent  name, 
then  goto  step  9. 

Step3:  Using  the  routine  RETRNAM  and  RETREVE,  get  all  the 

storage  entry  pointers  for  the  parent  name.  Save 
those  pointers  in  DIRTEMP. 

Step4:  Usinq  the  routine  ACRCP,  eliminate  all  those 

pointers  in  DIRTEMP,  for  which  the  parent  name 
represented  by  that  pointer  does  not  match  with 
the  parent  name  obtained  in  step  2. 

Step5:  If  there  is  no  pointer  in  DIRTEMP  then  qoto  step 

9.  If  more  than  one  pointer  exists  in  DIRTEMP, 
then  goto  step  7. 

Step6:  DIRTEMP ( 1 ) gives  the  storage  pointer  to  the  data 

name.  Usinq  the  routine  SETDTRP , establish  the 
links  between  parent  and  daughter  names.  At  end, 
qoto  step  9. 

Step7:  The  data  descriotion  for  this  name  is  ambiquous, 
that  is,  there  are  more  than  one  dd  statements  for 
its  parent.  In  this  case,  select  that  parent 
which  is  "closest"  to  it. 

Step8:  Using  the  routine  SETDTRP,  establish  the  links 

between  the  parent  and  daughter  names. 

Step9:  If  the  storage  entry  represents  a file  name  then 

check  if  it  is  in  a source  file  or  target  file  « 
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statement.  Accordingly,  set  the  SRC_5Y_DDS  or 
TAR_BY_DDS  bit  in  the  storage  entry  of  the  data 
name. 

SteplO:  If  the  stoiage  entry  represnets  a group,  field,  or 
record  name  then  check  if  it  is  a descendant  of  a 
source  or  a target  f ile . Accordingly,  set  the 
corresponding  bit  in  the  storage  entry  of  the  data 
name. 

Stepll:  Return. 

8. 1.2.1  The  GETQNM  Procedure  (Get  Qualified  Name). 

The  function  GETQNM (PTR,X)  returns  the  fully  qualified 
name  in  the  storage  entry  (given  by  the  storage  entry 
pointer,  PTR) , the  prefix  of  which  is  the  same  as  the 
second  argument,  X.  For  example,  if  the  storage  entry 
contains  the  key  names  '$G',  '$GX',  ' # EA  ' , '$PY'  AND 

'#QZ ' , in  that  order  (this  storage  entry  represents  the 
statement:  X IS  GROUP ( Y.Z , (A) ) ) , then  the  following  use  of 
the  function: 

XY  * GETQNM (PTR,  *$P  ') 

will  return  the  qualified  name  'SPY  #QZ  ' in  XY;  and  the 
following  use  of  the  function: 

XY  » GETQNM  (PTR,  '#E  ') 
will  return  the  name  '#EA ' in  XY. 
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8 . 1 . 2 . 2 The  SETDTRP  Procedur e ( Set  Dtr-Par ent  Link ) . 

The  SETDTRP (P,D)  procedure  is  used  to  establish  the 
link  between  the  storage  entry  pointer  P (parent  of  D)  and 
the  storage  entry  pointer  D (daughter  of  P) . As  described 
earlier,  the  storage  entry  pointers  of  all  daughter  names 
of  a data  name  are  maintained  in  a list  given  by  dtr-ptr 
of  the  storage  entry  (see  the  description  of  common  data 
area  in  section  4.5).  The  routine  SETDTRP  adds  the 
storage  entry  pointer  D to  the  daughter  list  of  P such 
that  the  user-line#s  of  all  the  entries  in  the  list  are  in 
ascending  order.  For  example,  if  the  list  contained  2 
data  names  with  user-line#s  5 and  10  respectively,  and  the 
user-line#  of  D is  8,  then  the  storage  entry  D is  added  in 
between  the  two  data  names;  however,  if  the  user-line#  of 
D is  20,  then  the  storage  entry  D is  added  to  the  end  of 
the  list. 

8 . 1 . 2 . 3 The  ACRPTR  Procedure  ( Analyze  PTR  Var iables ) . 

The  ACRPTR (PTR)  procedure  adds  the  exist  names,  length 
names  and  the  key  names  (if  any)  associated  with  the 
storage  entry  (given  by  the  storage  entry  pointer,  PTR)  to 
the  dictionary.  The  algorithm  A8.4  describes  the 
functions  of  the  routine  ACRPTR.  For  example,  if  PTR 
represents  the  storage  entry  pointer  corresponding  to  the 
following  statement: 
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A IS  FIELD (B, (R1,R2) , CHAR ( LI ) ) 
and  the  data  names  Rl,  R2  and  LI  ate  not  descendants  of 
the  same  file  as  that  of  A,  or  the  data  names  Rl , R2  and 
LI  appear  after  the  data  name  A (that  is,  uset-line#s  of 
Rl,  R2  and  LI  are  greater  than  the  user-line#  of  A),  then 
the  following  interim  dd  statements  are  generated: 

Rl  IS  INTERIM (EX 1ST. B. A) 

R2  IS  INTERIM ( EX 1ST. B. A) 

LI  IS  INTERIM ( LEN . B . A) 

If  however,  any  of  the  names  Rl,  R2  or  LI  occurs  before 
the  data  name  A and  is  the  descendant  of  the  same  file  as 
A,  then  there  is  no  necessity  of  generating  a dd  statement 
for  that  name,  because,  such  a definition  can  directly  be 
handled  using  the  "REFER"  option  of  PL/I  Optimizing 
Compiler.  This  is  further  described  in  the  later- 
chapters. 

If  there  is  a key  name  in  the  dd  statement  (storage 
entry  pointer  to  which  is  given  by  PTR) , an  interim 
statement  for  the  pointer  name  is  always  generated. 

The  routine  SRCHFIL  used  in  step  4,  searches  the  length 
or  exist  name  in  the  file  structure  and  returns  'l'B  is 
the  search  is  successful,  ’O'B,  otherwise. 

The  routine  GENQST,  used  in  step  6 of  the  algorithm  is 
described  in  the  following  sub-section. 
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Algorithm:  A8.4,  ACRPTR ( PTR) . 

Function:  Adds  to  the  dictionary  the  lenqth,  exist  and 
pointer  names  in  the  dd  statement. 

Calls:  GENQST,  SRCHFIL. 

Called  by:  ACRDICT . 

Stepl:  Get  all  the  key  names  in  the  storage  entry.  For 

each  key  name  generate  the  following  statement 
using  the  routine  GENQST: 

key-name  IS  INTERIM  (POINTER. aual-name) 
where  qual-name  is  the  qualified  name  that  the 
storage  entry  (given  by  PTR)  represents. 

Step2:  For  each  length  or  exist  name  in  the  dd  statement 

perform  steps  3 thru  7. 

Step3:  If  the  name  is  defined  by  then  goto  step  6. 

Step4:  Using  the  routine  SRCHFIL,  see  if  the  name  is  in 
the  same  file  (as  the  name  qiven  by  PTR) , and  that 
name  (exist  or  length)  occurs  before  it.  If  it 
is,  then  goto  steD  7. 

Step6:  Generate  the  following  DD  statement: 

name  IS  INTERIM  (prefix. qual-name) 
where  "name"  is  the  exist  or  the  length  name,  and 
prefix  is  'EXIST'  or  'LEN'  according  as  "name"  is 
an  exist  name  or  length  name  respectively. 
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"qual-name"  is  the  qualified  name  that  the  storaqe 
entry  (given  by  PTR)  represents.  Use  the  routine 
GENQST  to  generate  the  statement. 

Step7:  Continue. 

Step8:  Return. 


8. 1.2. 4 The  GENQST  Procedure. 

The  GENQST (N , I , P)  procedure  is  used  to  generate  a dd 
statement  for  all  the  names  in  the  qualified  name  N.  I 
specifies  the  type  of  data  name  that  N represents.  It  is 
one  of  the  following  names:  'EXIST',  'LENGTH',  'POINTER', 
'SOURCE',  'TARGET',  'NEW',  'OLD'  or  'SUBSET'. 

The  storage  entry  pointer  of  the  last  name  in  N is 
returned  in  the  third  argument,  P.  Algorithm  A8.5 
describes  the  functions  of  the  nrocedure  GENQST. 

Note  that,  because  the  type  of  data  name  is  known,  the 
parent  of  the  data  name  is  also  known  (which  is  one  of  the 
group  names:  'EXIST',  'LENGTH',  ...  'SUBSET').  Thus,  in 
the  algorithm  A8.5,  it  is  easy  to  search  for  a dd 
statement  of  a simple  name  in  N,  by  searching  the  daughter 
list  of  its  parent  name.  For  example,  if  I = 'EXIST'  and 
N is  'REP1 ' then  the  daughter  list  of  the  data  name 
'$GEXIST'  is  searched  for  the  name  REP1. 
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Algorithm:  A8.5,  GENQST (N, I , P) . 

Function:  Generates  dd  statements  for  the  names  in  the 
qualified  name  in  N,  the  type  of  which  is  given  by 
the  second  arqument. 

Calls:  ACRDO,  GENSTMT  and  SETDTRP. 

Called  by:  ACRPTR. 

Stepl:  Generate  a group  statement  for  the  Ith  data  name, 
if  this  data  name  is  the  first  one  in  that  type. 
Use  the  routine  to  generate  the  dd  statement  and 
the  routine  ACRDO  to  add  the  name  to  the 
dictionary. 

Step2:  For  each  simple  name  in  N perform  steps  3 thru  6. 

Step3:  Check  if  there  exists /a  dd  statement  for  the 
simple  name  (A  dd  statement  exists  if  the  simple 
name  is  contained  in  the  daughter  list  of  its 
parent).  If  there  is  a dd  statement  already,  then 
goto  step  6. 

Step4:  Generate  a dd  statement  for  the  simple  name  usinq 
the  routine  GENSTMT.  Also,  add  the  name  to  the 
dictionary  using  the  routine  ACRDO. 

Step5:  Establish  the  link  between  dauqhter  and  parent 
names  using  the  routine  SETDTRP. 

Step6:  Continue. 
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Step7:  Return. 


8.1.3  The  ACRSTVS  Procedure  (Analyze  Vats,  in  Assettion) . 

The  ACRSTVS (PTR)  procedure  analyzes  the  assertion,  the 
storage  entry  pointer  of  which  is  qiven  by  the  argument, 
PTR.  It  performs  the  following  functions: 

(1).  Obtains  all  the  source  and  target  variables  in  the 
assertion. 

(2) .  If  necessary,  generates  dd  statements  for  the 
data  names  used  in  the  assertion. 

(3) .  Analyzes  the  subscript  of  each  data  name,  and, 
if  necessary,  simplifies  the  assertion  by  reducing 
the  subscript  of  each  data  name  to  the  standard 
form:  "a*I+b",  where  "a"  and  Hb"  are  constants  (or 
simple  variable  names)  and  I is  a subscript  name. 
This  process  of  simplifying  assertions  can  also 
cause  the  generation  of  some  additional  assertions. 
(See  also  the  description  of  the  routine  GET_AE). 

(4) .  Obtains  the  lower  bound,  upper  bound,  and 
increment  for  each  subset iot  of  a data  name.  These 
ate  obtained  from  the  subscript  itself  and  the  range 
specified  in  the  "if-clause"  of  the  assertion. 

The  algorithm  A8.6  describes  the  functions  of  the 
routine  ACRSTVS  in  greater  detail. 
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Each  variable  that  was  found  in  the  assertion  is  stored 
in  a list,  the  format  of  which  is  described  in  section 

6.2.1. 

The  routine  GET_AE  in  stepl  of  algorithm  A8.6,  sets  the 
lower  bound  (lb)  and  upper  bound  (ub)  of  a subscript  to 
the  offset  (off)  specified.  These  values  are  changed  in 
step  6A,  where  the  entries  lb,  ub  and  inc  are  computed 
using  the  routine  MERGSUB,  which  uses  the  information 
specified  in  the  dd  statement  for  the  data  name  and  the 
"if-clause"  of  the  assertion.  The  routine  MERGSUB  is 
described  in  section  8.1. 3. 3. 

Algorithm:  A8.6.  ACRSTVS(S) . 

Function:  Analyzes  the  assertion,  the  storage  entry 
pointer  of  which  is  given  by  the  argument,  S. 

r 

Calls:  GET_AE,  GENSTMT,  SETDTRP , ACRD1 , MERGSUB. 

Called  by:  ACRDICT. 

-L 

Stepl:  Using  the  routine  GET_AE,  get  all  the  source  and 

target  variables  in  the  assertion.  A list  of  all 

! 

the  variables  are  then  created,  and.  the  pointer  to 
that  list  is  saved  in  the  pointer  ”vl-ptr", 
located  in  the  extended  data  area  of  the  assertion 
(see  also  the  section  6. 1.4. 3). 

I 

Step2:  For  each  variable  obtained  in  stepl,  perform  steps 


f 
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3 thru  10. 

Step3:  For  each  simple  name  in  the  variable  name,  perform 

steps  4 thru  9. 

Step4:  If  the  simple  name  does  not  have  a dd  statement, 

then  generate  a dd  statement  for  that  name  using 
the  routine  GENSTMT.  If  the  variable  name 
contains  only  one  simple  name,  then  an  interim 
name  is  generated,  otherwise,  a group  statement  is 
generated  except  for  the  last  simple  name,  for 
which  a field  statement  is  generated. 

Step5:  If  there  is  more  than  one  dd  statements  for  the 

simple  name,  then  select  the  one  with  "matchinq" 
parent  name. 

Step6:  Using  the  routine  SETDTRP , establish  links  between 

the  simple  name  and  its  parent. 

Step6a:  Using  the  routine  MERGSUB,  obtain  the  lower  bound 
(lb),  upper  bound  (ub)  and  increment  (inc)  for  the 
subscrioted  variable. 

i 

Step7:  Using  the  routine  ACRD1,  create  an  entry  for  the 

simple  name  in  the  dictionary. 

Step8:  Set  the  bit  SRC_BY_ASS  or  the  bit  TAR_3Y_ASS 
according  as  the  variable  name  is  the  source  or 

1 

target  of  the  assertion,  respectively. 


Step9:  Continue 


I 

I 

l 

I 
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SteplO:  Continue. 

Stepll:  Return. 


8. 1.3.1  The  GET-AE  Procedure  (Get  Vats,  in  At ith  Expn. ) . 

The  GET_AE (S)  ptocedute  obtains  all  the  variables  in 
the  atithmetic  exptession  of  an  assettion.  The  argument, 
S,  points  to  the  DT  fot  the  arithmetic  expression.  The 
main  functions  of  the  procedure  GET_AE  are  outlined  in  the 
algorithm  A8.7.  In  the  following,  the  algorithm  is 
illustrated  with  some  examples. 

One  of  the  main  functions  of  the  routine  GET_AE  is  to 
simplify  the  assertion  such  that  any  subscript  associated 
with  a variable  name  is  in  the  standard  form:  "a*I+b" 
(Step  4).  For  example,  in  the  following  assertion: 

AB ( 20*A+B+D)  = C(I) 

the  subscript  of  AB  is  not  in  the  standard  form.  This 
subscript  can  be  reduced  to  the  standard  form,  once  the 
data  types  of  the  variable  names  A,  B and  D are  known.  In 
the  following,  three  cases  are  considered. 


Case(l)  If  none 

of 

the  variables 

names  A,  B and  0 

is 

defined 

as 

a subscript. 

then 

the  subscript 

is 

reduced 

to 

the  standard 

form 

by  generating 

the 

following  assertion: 

$10001  « 20*A+B+D 


and  chanqinq  the  original  assertion  to: 

AB ( $10001 ) = C(I) 

(in  this  case,  a =0  and  b = $10001,  in  the 

standard  form  of  the  subscript  expression). 


Case  (2)  If  A was  declared  as  a subscript,  then  the 
following  assertion  is  qenerated: 

$10001  = B + D 

and  the  original  assertion  is  changed  to: 

AB ( 20  *A+$ 10001 ) = C(I) 

Case (3)  If  A and  D were  not  declared  as  subscripts,  but  B 
was,  then  the  following  assertion  is  qenerated: 

$10001  * 20*A+D 

and  the  original  assertion  is  chanqed  to: 

AB  (B+$I0001 ) * C(I) 

The  functions  described  above  are  performed  in  step  4 
of  the  algorithm.  The  general  rules  for  simplifying  a 
subscript  are  given  below: 

(1).  Search  the  derivation  tree  for  a subscript  name.  Any  r, 

name  that  appears  in  the  leaf  of  the  tree  is  a 


subscript  name  if  the  following  conditions  ate 
satisf ied : 

(a).  The  name  is  defined  as  a subscript  (by  a 
subscript  statement)  or  the  name  does  not  have 


\ 


name  is  one  of  the  letters,  'I',  'J',  'K',  'L', 

' M ' , or  ' N ’ . 

(b) . The  name  does  not  have  a subscript. 

Note  that,  if  the  assertion  given  above  is  slightly 
changed  to  the  following: 

AB ( 20*J+B+D)  = C(I) 

the  case(l)  above,  will  generate  the  following 
assertion : 

$10001  = B + D 

and  the  original  assertion  is  changed  to: 
AB(20*J+$I0001)  = C ( I ) 

(2) .  If  a subscript  name  was  found,  then  the  tree  is 

transformed  such  that  the  subscript  name  is  the  left 
most  "handle"  in  the  tree.  For  example,  if  in  the 
subscript,  "20*A+B+D",  B is  found  to  be  the  subscript 
name,  then  the  tree  is  chanqed  such  that  it  now 

represents  the  expression:  "B+20*A+D". 

(3) .  If  no  subscript  name  is  found,  and  the  subscript  is 

not  a constant  (or  a simple  name) , then  an  assertion 

is  generated  as  in  case(l)  above. 

(4) .  Now  the  offset  and  increment  associated  with  the 

subscript  name  is  determined.  If  the  offset  ("a") 
and  increment  ("b")  are  not  constants  (or  simple 
variable  names)  then  appropriate  assertions  are 
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generated  as  in  the  cases  (2)  and  (3)  above. 

Algorithm:  A8.7,  GET_AE(S). 

Function:  Obtians  all  the  variables  in  the  arithmetic 

expression  represented  by  the  argument  pointer,  S. 

Calls:  GENASSI. 

Called  by:  ACRSTVS. 

Stepl:  Obtain  all  the  variable  names  in  the  arithmetic 

expression  by  scanning  the  DT  recursively.  For 
each  variable  name  obtained,  perform  steps  2 thru 
9. 


Step2 : 

Step3: 

Step4 : 


For  each  subscript  of  the  variable  name  perform 


steps  3 thru  8. 

Check  if  the  subscript  is  in  the  standard  form: 

a*I+b,  where  "a"  and  "b"  are  constants  (or  simple 

variable  names)  and  "I"  is  a subscript  name.  If 

the  subscript  is  in  the  standard  form  then  goto 

step  7.  V 

Reduce  the  subscript  to  the  standard  form  by 

generating  additional  assertions.  Use  the  routine 


GENASSI  to  generate  assertions.  This  reduction 


procedure  can  generate  at  most  two  assertions  of 
the  following  form: 
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I 

V 

l 

l 

F ' 

i : 

I 

and  b = <arith-exDression> 

! 

StepS:  If  the  subscript  is  a constant,  then  set  the 

corresponding  bit  (bits (6),  see  the  description  of 
VAR-LIST  in  section  8.1.4)  in  VAR-LIST. 

Step6:  If  the  subscript  is  "misc-data"  (float  data  or  ' 

strinq)  then  set  the  corresponding  bit  (bits (5)) 
in  VAR-LIST. 

4 

3tep7:  Get  the  offset  and  increment  for  the  subscript  and 
save  it  in  OFF  and  INC  (of  VAR-LIST)  respectively. 

Step8:  Continue. 

Step9:  Continue. 

SteplO:  Return. 


8. 1.3. 2 The  GEN AS SI  Procedure  (Generate  Asset  t ions ) . 

The  GENASSI ( P, L)  procedure  is  used  to  generate  an 
assertion  during  the  simplification  of  a subscript.  The 
first  argument,  P,  points  to  the  OT  of  an  arithmetic 
expression,  ol  a diadic  expression  deoendinq  on  the  second 
parameter  L.  Algorithm  8.8  describes  the  functions  of  the 
routine  GENASSI. 
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Algorithm:  8.8,  GENASSI (P,L) . 

Function:  To  generate  an  assertion  for  the  expression 

represented  bv  P and  L.  P points  to  an  arithmetic 
expression  or  a diadic  expression  depending  on  the 
second  argument  L. 

Calls:  STORE. 

Called  by:  GET_AE. 

Stepl:  Get  an  interim  name  (name  prefixed  by  '$1'  and 

followed  by  a 4 digit  number,  example,  $10001)  and 
an  assertion  name  ('SA'  concatenated  with  the 
interim  name) . 

Step2:  Generate  dd  statement  for  the  interim  statement 
obtained  in  step  1. 

Step3:  Allocate  entries  for  "assertion",  "simple 

assertion",  and  "arithmetic  expression"  and  link 
them  appropriately.,,  such  that  the  expression  given 
by  the  argument  P is  the  RHS  of  the  assertion  and 
the  interim  name  obtained  in  step  1 as  the  LHS  of 
the  assertion.  Also,  change  P to  an  arithmetic 
expression  that  consists  of  just  the  interim  name 
obtained  in  step  1. 

Step4:  Using  the  routine  STORE,  store  the  assertion  in 

the  associative  memory.  Also,  stack  the  storage 
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entry  oointer  for  the  assertion,  so  that  it  can  be 
analyzed  using  the  routine  ACRSTVS  (see  the 
description  of  routine  ACRDICT) . 

Step5:  Return. 
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8. 1.3. 3 The  MERGSUB  Procedure  ( Me rge  Subscript  Ranges ) . 

The  MERGSUB ( P ) procedure  obtains  the  subscript  ranges 
(lower  bound,  upper  bound  and  increment)  for  all 
dimensions  of  a variable.  VARLISTP  points  to  the 
structure  VAR_LIST  which  contains  the  information  about  a 
varaible  used  in  an  assertion  (the  root  of  the  DT  of  which 
is  given  by  P) . The  algorithm  A8.9  describes  the 
functions  of  the  routine  MERGSUB.  In  the  following,  the 
routine  MERGSUB  is  illustrated  with  some  examoles. 

In  Stepl  of  the  algorithm,  the  dimension  of  the 
variable  under  considersion  is  obtained  from  the  dd 
statement  of  the  variable  name  and  that  of  its  parent. 
Fot  example,  if  the  following  dd  statements  were  qiven: 

A IS  GROUP ( X , (5) ) 

B IS  FIELD (A, (10) ) 

Then  B is  tecoqnized  as  a variable  of  dimension  2 and  size 
5x10. 

If  the  dimension  computed  in  steel  is  greater  than  the 
number  of  subscripts  specified  for  the  variable  (#0FI  in 
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the  structure  VAR_LIST) , then  some  of  the  subscripts  in 
the  assertion  have  been  ommitted.  In  this  case,  the 
structure  VAR_LIST  is  updated  to  contain  the  maximum 
number  of  subscripts  (determined  in  step  1).  For  example, 
if  VAR_LIST  represents  the  subscripted  variable  B(3),  and 
B was  described  as  of  having  2 dimensions  (with  size  = 
5x10,  as  above),  then  the  structure  VAR_LIST  is  changed  to 
represent  the  subscripted  variable  B(*,3). 

• If  no  range  for  a subscript  was  soecified  in  the 

"if-clause"  of  the  assertion  (that  is,  there  does  not 
exist  any  if-clause  for  the  assertion,  or  the  if-clause 
contains  just  a boolean  expression,  and  not  a subscript 
expression) , then  the  lower  bound  and  upper  bound  for  each 
dimension  of  the  variable  is  obtained  from  the  dimension 
and  size  of  the  variable  obtained  in  step  1. 

If  a range  for  a subscript  is  soecified  in  the 

V 

• if-clause  of  the  assertion,  then  that  range  is  added  to 

the  lower  and  upper  bounds  in  lb  and  ub.  For  example,  if 
a variable  was  subscripted  by  "2*1+5",  then  the  lower  and 
upper  bounds  for  the  corresponding  dimension  would  be  both 
7.  If  the  if-clause  of  the  assertion  in  which  the  above 
subscripted  variable  occurs  is  of  the  form:  "For  I * 1 to 
5",  then  the  lower  and  upper  bounds  (lb  and  ub)  for  the 
corresponding  variable  are  changed  to  7 and  15 
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respectively.  These  computaions  ate  done  in  step  7 of  the 
alqorithm.  The  inclement  for  the  subscript  is  also 
computed  in  step  7 by  multiplying  the  value  specified  in 

r 

the  subscript  with  the  value  qiven  in  the  if -clause.  In 
the  above  example,  the  inclement  is  2 (the  inclement 
specified  in  the  if-clause  is  1).  However,  if  the 
if-clause  in  the  assertion  is  chanqed  to  "For  I = 1 to  5 

BY  2"  then  the  increment  is  4 (note  that,  lb  and  ub  in 
this  case  aie  7 and  13  respectively). 


Alqorithm:  A8.9,  MERGSUB ( ASSPTR , STEPTR ) 

Function:  Obtains  the  loweL  and  upper  bounds  and  the 

increment  associated  with  any  subscript  of  a 
variable  used  in  an  assertion.  ASSPTR  ooints  to 
the  derivation  tree  of  the  assertion  and  STEPTR  is 

I' 

the  storage  entry  pointer  of  the  variable  name  for 
which  the  bounds  and  the  increment  ate  to  be 
supplied. 

Calls: 

Called  by:  ACRSTVS 

Stepl:  Get  the  Dimension,  ID,  of  the  data  name  qiven  by 
the  storage  entry  pointer  STEPTR.  Also,  get  the 
bounds  of  each  dimension  in  the  array  SIZE. 

Step2:  If  ID  < IOFI  (dimension  of  the  subscripted 

1 
I 
I 


I 
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Step3 : 


Step4 : 

Step5: 
Step6 : 


Step7 : 


variable  used  in  the  asserton)  then  goto  steo  11. 
If  #D  = #OFI  then  goto  step  5.  Note  that  the 
external  pointer  VARLISTP  points  to  the  structure 
VAR_LIST  that  describes  the  variable  name  used  in 
the  assertion. 

Create  a new  VAR^LIST  (save  the  pointer  in  TPTR) 
such  that  it  can  accommodate  #D  dimensions.  Also, 
move  the  subscript  information  associated  with  the 
variable  to  the  last  #OFI  dimensions  of  the  newly 
created  VAR  LIST. 

Set  the  default  values  for  the  first  #D-#OFI 
subscripts  (lb  » 0,  ub  * 0 and  subscr ipt-ptr  = 
null) . 

For  I = 1 to  ID  perform  steps  6 thru  9. 

If  the  subscript  name  was  not  specified  ('*'  was 
used),  or  no  range  for  that  subscript  was 
specified  in  the  if-clause,  then  set  the  lower  and 
upper  bounds  (lb  and  ub)  for  the  I th  dimension 
with  the  corresponding  values  from  SIZE,  obtained 
in  step  1. 

If  there  was  a subscript  range  for  the  I th 
dimension,  then  add  the  lower  and  upper  bounds 
specified  in  the  if-clause  to  the  corresponding 
values  in  VAR  LIST.  Also,  obtain  a new  increment 
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foL  the  subscript  by  mu  1 1 iply inq  the  increments 
specified  in  the  if-clause  and  in  the  subscript 
itself. 

Step9:  Continue. 

SteplO:  Go  to  step  12. 

Stepll:  Increase  the  dimension  of  the  data  name  qiven  by 
STEPTR  by  IOFI-ID  and  noto  steD  5. 

Stepl2:  Return. 


8.1.4  Analysis  of  Subset ipt  Statements  ( Pi ocedute  ASUBST) . 

The  ptocedute  ASUBST  is  used  to  analyze  each  subscript 
statement  specified  by  the  user  and  check  its  consistency. 
The  function  of  the  ptocedute  is  to  check  the  subscript 
statements  to  satisfy  the  followinq  rules: 

(1) .  Only  one  statement  can  be  specified  for  each 

subscript  statement.  If  mote  than  one  is  specified, 
then  all  those  statements,  except  one  (arbitrarily 
chosen)  , ate  deleted. 

(2) .  Only  one  subscript  name  is  associated  with  any 

dimension  of  a data  nane.  If  mote  than  one  is 
specified,  then  all  those  statements,  except  one 
(arbitrarily  chosen),  is  deleted.  Note  that  if  a 
data  name  is  of  dimension  > 1,  then  one  subsciiot 
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name  can  be  associated  with  each  dimension  of  that 
data  name. 

(3).  If  any  patent  name  is  specified  for  a subscript 
name,  then  that  parent  name  must  be  a repeating  data 
name.  If  not,  the  parent  structure  of  the  data  name 
is  searched  for  a repeating  data  name.  If  a 
repeating  data  name  is  found,  then  that  data  name  is 
used  as  the  parent  for  the  subcript  name;  otherwise, 
if  the  data  name  is  not  an  interim  data  name,  then 
the  inconsistency  is  reported  as  an  error  to  the 
user.  However,  if  the  data  name  is  an  interim  data 
name,  its  dimension  is  increased  accordingly.  As  an 
example,  consider  the  following  set  of  statements: 

G IS  GROUP ( 10 ) ; 

A IS  GROUP (G); 

B IS  FIELD (A)  ; 

I IS  SUBSCRIPT (B)  ; 

In  the  above  example,  B is  patent  of  I,  but  B is  not 
a repeating  data  name.  However,  its  grand  parent  G 
is  repeating,  therefore  the  procedure  ASUBST  changes 
the  subscript  statement  to: 

I IS  SUBSCRIPT (G) ; 
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8.1.5  The  RSORT  Procedure  (Resolve  as  Source  oi  Tarqet). 


The  procedure  RSORT  is  used  to  resolve  a data  name  to 
he  source,  target,  interim  ol  undate.  This  resolved 
information  is  stored  in  "STATUS-BITS"  of  the  structure 
"data-list"  described  in  section  5. 1.4.1.  The  routine 

RSORT  uses  the  following  rules  for  resolving  the  type  of 
data : 

(1) .  A data  name  is  resolved  as  an  update  data  name  if  it 

is  defined  as  an  update  data  name  (by  including  the 
parent  file  name  in  both  the  source  file  and  target 
file  statement).  In  this  case,  8 th  bit  of 

STATUS-BITS  is  set. 

(2) .  A data  name  is  resolved  as  tarqet,  if  either,  the 

data  name  is  defined  as  tarqet,  or  if  the  data  name 
is  never  used  as  source  in  any  assertion.  In  this 
case,  6 th  bit  of  STATUS-BITS  is  set. 

(3) .  A data  name  is  resolved  as  source,  if  either,  the 

data  name  is  defined  as  source,  or  if  the  data  name 
is  never  used  as  tarqet  of  any  assertion.  In  this 
case,  5 th  bit  of  STATUS-BITS  is  set. 

(4) .  A data  name  is  resolved  as  interim,  if  it  is  used  as 

source  in  one  assertion  and  as  target  in  another. 
In  this  case,  7 th  bit  of  STATUS-BITS  is  set. 

When  a data  name  is  resolved  as  source,  tarqet. 
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interim,  or  update,  all  its  predecessors,  and  the 
successors  of  those  predecessors  are  assumed  to  be  of  the 
same  type.  However,  if  the  resolved  data  type  conflicts 
.with  the  previously  resolved  type,  then  type  that  has 
hiqher  priority  is  used.  The  priority  of  different  types 
are  given  in  the  following: 

UPDATE  (highest) 

INTERIM 

TARGET 

SOURCE  (lowest) 

For  example,  if  A and  B are  descendants  of  a group  G,  and 
if  A is  resolved  as  target,  while,  B is  resolved  as 
interim,  then  both  are  resolved  to  be  interim  data  names. 

8.1.6  The  DUPLUDE  Procedure  (Dupl . Update  Diet  Entries) . 

This  routine  creates  duplicate  storage  entries  for 
update  data  names,  that  is,  for  those  data  names  that  are 
used  both  as  source  and  target.  Note  that  there  does  not 
exist  two  storage  entries  for  each  update  data  name  (only 
duplicate  dictionary  entries).  The  duplicate  dictionary 
entries  are  stored  in  INDEXl  and  INDEX2  of  the  "DATA-LIST" 
structure  described  in  section  6. 1.4.1.  Also,  not  every 
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update  data  name  entry  is  duplicated;  only  those  entries 
that  are  updated  by  some  assertion  are  duplicated.  For 
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example,  consider  the  followinq  set  of  statements: 

SOURCE  FILE:  MASTER; 

TARGET  FILE:  MASTER; 

MASTER  IS  FILE; 

REC  IS  RECORD ( MASTER ) ; 

CUST#  IS  FIELD (REC) ; 

3ALANCE  IS  FIELD ( REC , ...  ); 

• • • 

BALANCE  = BALANCE  + 100; 

In  this  examole,  only  the  name  BALANCE  is  updated. 
Therefore,  there  will  be  two  dictionary  entries  for  the 
name  BALANCE  and  all  its  predecessors  (in  this  case,  REC 
and  MASTER).  The  data  name  CUST#  is  not  updated,  and 
therefore,  only  one  entry  will  be  provided  for  that  name. 
After  the  creation  of  the  precedence  matrix,  the  directed 
qraph  for  the  above  examole  will  be  as  shown  in  Fiqure 


8.4 


Figure  8.4.  Directed  Graph  for  a MODEL  Specification 
Using  an  UPDATE  File. 
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8.2  Generating  the  Precedence  Matrix  ( Pr  ocedute  ACRADJM) . 

In  this  section,  the  Generation  of  orecedence  matrix 
(PM)  is  described.  The  organization  of  the  oiecedence 
matrix  was  described  in  section  8.3.  Note  that  the  PM  is 
not  stored  as  a nxn  matrix,  but  as  a list  of  structures, 
one  list  for  each  dictionary  entry. 

The  Algorithm  A3. 10  describes  the  functions  of  the 
routine  ACRADJM  (see  also,  the  Figure  8.3).  In  the 
following,  the  different  steos  of  the  algorithm  are 
described  with  some  examoles. 

The  different  types  of  precedence  relations  (entries  in 
the  precedence  matrix)  ate  described  in  section  6.5. 

As  an  example,  consider  the  following  set  of  MODEL 

statements: 

AO:  D . A = C.B  + E . A 

Al:  C.8  = E.A 

Assuming  that  no  other  MODEL  statements  were  specified, 
the  data  names  C and  C.B  would  be  resolved  as  interim,  D 
and  D . A as  tarqet,  and  E and  E.A  as  source  in  procedure 
RSORT  (see  section  8.4).  With  these  assumptions,  the 

entries  (lb,Ic),  (la, Id)  and  (Ie,Ia)  are  set  to  2,  2 and  1 

respectively,  where  la,  lb,  Ic,  Id  and  le  represent  the 

dictionary  indices  of  the  data  names  A,  n,  C,  D and  E 
respectively.  Note  that,  qualified  interim  variable  names  » 

• 
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ate  treated  the  same  way  as  the  qualified  target  variable 
names.  rte  do  allow  structured  description  of  interim 
variables,  just  the  same  way  as  a source  or  target  file 
structure . 

All  the  value  dependency  relationships  (type  3 and  type 
4 entries  between  an  assertion  name  and  the  source  and 
targe.t  variables  used  in  that  assertion)  are  entered  in 
step  4 of  algorithm.  The  hierarchical  relationships 
(type-1  and  type-2  entries  between  data  names  and  their 
parents)  are  entered  in  step  5.  The  pointing 
relationships  (types  5,  6 and  7)  are  entered  in  step  6. 

To  illustrate  the  function  of  step  6 in  the  algorithm, 
assume  that  the  MODEL  specification  contains  the  following 
statement : 

A IS  FIELD ((E) ,CHAR (L) ) 

In  this  case,  the  dictionary  contains  entries  for  A,  E and 
L.  The  function  of  step  6 is  to  set  the  entries  (J,I)  and 
( K , I ) to  5 and  6 respectively,  where  I,  J and  K are  the 
dictionary  indices  for  A,  E and  L respectively.  The 
function  of  the  routine  GETFNAM  is  to  obtain  the 
dictionary  index  of  the  storage  entry  that  contains  A and 
E or  A and  L,  while  processing  the  names  E and  L 
respectively. 

If  the  MODEL  specification  contains  subscripted 
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variables,  then  some  additional  tyoe-1  and  type-2 
relationships  are  entered  in  steo  8 usinq  the  routine 
3ETTYPE.  This  routine  also  enteis  type-8  relationships 
between  the  subscripted  data  names  (either  interim  ol 
tarqet) . 

The  routines  GETFNAM  and  SETTYPS  used  by  the  routine 
ACRADJM  aLe  described  in  the  sections  that  follow.  The 
routines  AOJMSET  and  ACRDR  are  explained  in  chanter  6. 

8.2.1  The  GETFNAM  Pi  ocedut  e (Get  St.  Enti_y  of  Name). 

The  GETFNAM ( N , I ) procedure  returns  in  I,  the  dictionary 
index  of  the  data  name  that  deoends  on  the  exist,  lenqth 
ol  pointer  name  represented  by  N.  For  example,  if  N is 
'EXIST. A. E',  then  the  dictionary  index  of  A which  contains 
the  exist  name  'E',  is  returned  in  I.  The  alqorithm  A8.ll 
describes  the  functions  of  the  procedure  GETFNAM . Note 
that  if  an  exist,  lenqth  or  pointer  name  was  specified  in 
a dd  statement  of  a data  name,  then  the  storaqe  entry  of 
that  data  name  contains  that  exist,  lenqth  or  pointer  name 
(prefixed  by  '#E',  '»L'  or  '#K'  respectively)  as  one  of 
the  key  names. 


i 

' 

Alqorithm:  A8.10,  ACRADJM. 

Function:  Generates  the  adjacency  matrix  for  all  the 
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entries  in  the  dictionary. 

Calls:  ADJMSET,  ACRDR,  GETFNAM , SETTYPE,  SETT56. 

Called  by:  MONITOR. 

Stepl:  For  I = 1 to  #DICT  (Number  of  dictionary  entries) 

perform  steps  2 thru  7. 

Step2:  Using  the  routine  ACRDR,  obtain  the  storage  entry 

pointer  of  the  I th  dictionary  entry. 

Step3:  If  the  storage  entry  does  not  represent  an 

assertion,  then  goto  step  5. 

Step4:  For  each  source  variable  in  the  assertion  set  the 

(J,I)  th  entry  (where  J is  the  dictionary  index  of 
the  source  variable  name)  to  3;  and  for  each 
target  variable  name  set  the  (I,J)  th  entry  (where 
J is  the  dictionary  index  of  the  target  variable) 
to  4.  Use  the  routine  ADJMSET  to  set  any  entry  of 
the  matrix.  At  end,  goto  steo  7. 

Step5:  If  the  data  name  has  a parent,  then  set  the  (J,I) 

th  entry  or  (I,J)  th  entry  (where  J is  the 
dictionary  index  of  the  patent  name)  to  to  1 or  2 
according  as  the  data  name  is  a source  or  target 
name  respectively.  If  the  data  type  is  not  known, 
then  set  the  (I,J)  th  entry  to  '2'. 

Step6:  If  the  data  name  represents  length,  exist,  or 

pointer  name,  then  set  (I,J)  th  entry  (where  J is 
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Step6a : 


Step7 : 
Step8 : 


Step9 : 


the  diction a ty  index,  obtained  usinq  the  routine 
GETFNAM , for  the  data  name  which  deoends  on  the 
name  reoresented  by  the  I th  dictionary  entry)  to 
5,  6 or  7 respectively. 

If  the  data  name  is  used  in  specifying  lenath  or 
repetition  of  another  data  name,  then  set  type-5 
or  type-6  relationship  between  the  two  data  names 
usinq  the  routine  SETT56. 

Continue . 

Usinq  the  routine  SETTYPE,  set  tyoe-8  entries  in 
the  dictionary.  These  entries  represent  the 
relationships  between  different  subscripted 
representations  for  a data  name. 

Return. 


Aloorithm:  A3. 11,  GETFNAM ( N , I ) . 

Function:  Obtains  the  dictionary  index  of  the  data  name 
that  depends  on  the  exist,  lenqth  or  pointer  name 
represented  by  N. 

Calls:  RETREVE. 

Called  by:  ACRADJM. 

Stepl:  Initialize  I to  0. 

Step2:  Using  the  routine  RETREVE,  qet  all  the  storaqe 
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entry  pointers  of  the  last  simple  name  in  N 
(prefixed  by  ’#E',  ’#L'  or  '#K'  accordinq  as  the 

first  name  in  N is  ’EXIST',  ’LENGTH'  or  ’POINTER’ 

V 

respectively) . 

Step3:  If  number  of  storage  entries  obtained  in  step  2 is 

0 then  return. 

Step4:  If  only  one  storaqe  entry  was  obtained  in  step  2, 

set  the  dictionary  index  of  that  storaqe  entry  to 

1 and  then  return. 

Step5:  Select  that  storage  entry  that  contains  N (except 
for  the  first  or  the  last  name)  and  set  I- 
accordingly. 

Step6:  Return. 

8.2.2  The  SETTYPS  Procedure  (Set  Type-1 , 2 & 8 Edges) . 

The  SETTYPE  procedure  enters  the  type-1,  type-2  and 
type-8  relations  between  different  subscripted  entries  of 
a data  name.  Typel  relatioshiD  is  established  between  a 
source  data  name  and  all  the  entries  that  represent 
different  subscripted  use  of  the  same  data  name.  Type2 
relationship  is  established  between  different  subscripted 
use  of  a target  name  and  the  target  name  itself.  Type8 
relationship  is  established  between  different  subscripted 
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use  of  an  intetim  or  tarqet  data  name. 

For  example,  if  the  variable  name  A is  a descendant  of 
a source  file,  and  an  element  of  A is  used  in  the 
following  assertion: 

Al:  3 = A ( 10 ) 

then  a type-1  relationship  is  entered  in  the  (I,J)  th 
entry,  where  I and  J are  the  dictionary  indices  of  A and 
A(10)  respectively . As  another  example,  if  the  variable 
name  A is  a descendent  of  a tarqet  file,  and  that  variable 
name  is  used  in  the  following  assertions: 

Al:  A ( 1 ) = 3(1) 

A2 : FOR  I = 2 TO  10  A(I)  = A ( I - 1 > 
then  type-2  relationships  are  entered  in  the  (J,I),  (K,I) 
and  ( L , I ) th  entries,  where  I,J,K  and  L are  the  dictionary 
indices  of  A,  A(l),  A(I)  and  A(I-l)  respectively.  Also, 
in  this  case,  tyoe-8  relationships  are  entered  in  the 
(J,K)  and  (L,K)  th  entries.  Type  8 relations  are 
described  in  the  following. 

As  described  earlier,  every  subscripted  variable  is 
stored  in  an  encoded  form  in  the  structure  VAR_LIST.  The 
encoded  representation  includes  the  lower  bound,  upper 
bound  and  the  increment  associated  with  the  subscript. 
This  information  is  useful  in  deriving  the  tyoe-8 
relationships  between  different  subscripts  of  interim  or 


i 
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target  data  names.  If  a type-8  relationship  exists 

between  two  entries  I and  J,  then  it  means  that: 

(1) .  I th  entry  is  a tarqet  of  some  assertion  (or  it 

represents  a field  in  a source  file), 

(2) .  J th  entry  is  a source  of  some  assertion  (or  it 

represents  a field  in  a target  file)  , and 

(3) .  At  least  one  element  in  the  set  of  elements  of  the 

array  represented  oy  the  subscriot  in  the  I th  entry 
is  contained  in  the  set  of  elements  represented  by 
the  subscriot  in  the  J th  entry. 

In  the  following,  the  representation:  A(l,u,c)  will  be 
used  to  represent  a subscripted  data  name  A,  the  lower 
bound,  upper  bound  and  increment  of  which  are  given  by  1, 
u and  c respectively.  As  an  example,  consider  the 
following  set  of  assertions: 

Al:  A ( 25 ) = 10 

A2:  FOR  I = 26  TO  50  BY  1 A(I)  *A(I-1) 

The  above  set  of  assertions  creates  three  subscripted 
entries  in  the  dictionary,  which  are  qiven  by:  A(25,25,l), 
A(26,50,l)  and  A(25,49,l)  (which  represent  the  subscripted 
names  A(25)  in  assertion  Al,  and  A(I)  and  A(l-l)  in 
assertion  A2  respectively).  The  directed  graph  for  the 
above  set  of  assertions  (which  includes  the  t.ype-8 
relations)  is  given  in  Figure  8.5. 
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The  algorithm  A8.12  summarizes  the  functions  of  the 
routine  SETTYPE.  The  routine  CHKCONT  used  in  step  10  is 
described  in  the  next  section. 
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Algorithm:  A8.12,  SETTYPE. 

Function:  Enters  type-1,  type-2  and  type-8  relations 

between  subscripted  data  names. 

Calls:  CHKCONT 

Called  by:  ACRADJM. 

Stepl:  For  I = 1 to  #DICT  perform  steps  2 thru  13. 

Step2:  If  the  entry  does  not  represent  a name  with 
subscript,  then  goto  step  13. 

Step3:  If  the  I th  dictionary  entry  represents  a source 

data  name  then  set  the  entries  (I,J1),  (I,J2),  ... 
(I,Jn)  to  1,  where  Jl,  J2,  ...  Jn  are  the 
dictionary  indices  of  the  different  subscripted 
representations  of  the  same  name. 

Step4:  If  the  I th  dictionary  entry  represents  a target 

data  name,  then  set  the  entries  (J1,I),  (J2,I), 
...  ( Jn , I ) to  2. 

Step5:  Let  J(l),  J(2),  ...  J(n)  be  the  dictionary 

indices  of  n subscripted  representations  of  the  I 
th  dictionary  entry.  If  n < 2 then  goto  step  13. 


r 
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Step6:  For  k = 1 to  n perform  steps  7 thru  12. 

Step7:  If  J(k)  th  dictionary  entry  is  not  a tarqet  of  any 

assertion,  then  goto  step  12. 

Step8:  For  1=1  to  k-1  & k+1  to  n perform  steDS  9 thru 


Step9:  If  J(l)  th  dictionary  entry  is  not  a source  of  any 

assertion,  then  goto  step  11. 

SteolO:  Using  the  routine  CHKCONT , set  the  adjacency 
matrix  entry  (J(k),J(l))  to  8 if  at  least  one 
element  represented  by  the  entry  J(k)  is  contained 
in  the  set  of  elements  represented  by  J(l). 

Stepll:  Continue. 

Stepl2:  Continue. 

Stepl3:  Continue. 

Stepl4:  Return. 


8. 2. 2.1  The  Diophantine  Equation : a.x+b. y=n. 


A type-8  relation  exists  between  two  nodes  i and  j 
(represented  by  A(1  ,u  ,c  ) and  A ( 1 ,u  ,c  ) respectively, 

i i i 1 j 1 

if  there  exists  at  least  one  set  of  non-negative  inteqers 
(x,y)  that  satisfies  the  following  relations: 

1 +x.c  *1  +y.c  -(1) 

i i j j 


a.-': 


I 

--  I 

( 

I 
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+ X . c 

<=  u 

-(2) 

» 

i 

i 

+ y . c 

<*  u 

-(3) 

r 

j j j 

Ignoring  the  relations  (2)  and  (3),  for  the  moment,  the 

relation  (1)  can  be  rewritten  as: 

x .c  - y . c =1  -1  -(4) 

i j i i 

The  above  equation  is  a Diophantine  equation  of  the  form 
a.x+b.y=n.  In  the  following,  some  of  the  well  known 
results  on  the  solution  of  Diophantine  equation  a.x+b.y=n 
will  be  sketched.  Some  theorems  are  stated  without  proof. 

The  proofs  can  be  found  in  standard  text  books  on  theory 
of  numbers  (GRI54). 

Theorem  8.1. 

The  linear  Diophantine  equation  a.x+b.y=n  has  a 

* t 

solution  iff  the  greatest  common  divisor  (GCD)  of  a 
and  b divides  n. 

As  an  example,  consider  two  nodes  represented  by 
(A, 1,10, 2)  and  A(2,10,2)  respectively.  The  intersection 
of  the  elements  represented  by  these  nodes  is  empty; 
therefore,  the  Diophantine  equation: 


1+m.  2 ■ 2 + n . 2 
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f * 

should  not  have  any  solution.  The  above  equation  can  be 

| 

rewritten  as  follows: 

2.  m - 2 . n = 1 

This  equation  does  not  satisfv  Theorem  8.1,  because 

i 

GCD(2,2)  = 2 which  does  not  divide  1.  Therefore,  there 

does  not  exist  any  solution  for  the  above  Diophantine 

equation.  ^ 

Definition : Two  or  note  integers  are  prime  to  other , or 
relatively  prime  if  their  greatest  common  divisor  is  one. 

For  example,  the  integers  6,  -9,  and  14  are  relatively 

prime.  , 

Theorem  8.2 


If  a and  b are  relatively  Drime  and  x=x,y=y  is  a 

0 0 


solution  of  the  equation  a.x+b.y  = n,  then  all  the 

solutions  of  the  equation  are  given  by  the  equations: 

x = x + k . b and  y = y - k . a 

0 0 

for  all  integral  values  of  k. 

8. 2. 2. 2 A Method  for  Finding  a Solution  of  a.x+b.y=n. 

The  following  method  for  findinq  the  solution  of  the 
Diophantine  equation  is  taken  from  the  book  by  Griffin 
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(GRI54) . 

If  GCD(a,b)  = d and  MOD(n,d)  = 0 then  by  dividing  each 
term  of  the  equation  a.x+b.y=n  bv  d,  we  obtain  an 
equivalent  equation,  that  is,  one  which  is  satisfied  by 
all  and  only  the  solutions  of  the  otiginal  equation. 
Consequently,  it  will  be  sufficient  to  solve  the  equation 
a.x+b.y=n  when  GCD(a,b)  = 1. 

We  ate  actually  interested  in  findinq  one  solution  of 
the  equation  a.x+b.y=n.  Once  a solution  is  obtained. 
Theorem  8.2  can  be  applied  to  get  all  the  solutions  of  the 
equation . 

Case(l).  If  MOD(n,a)  * 0 ther.  n = n .a,  therefore 

0 

y =0  and  x * n is  a solution  to  the  equation. 

0  0 0 

Case(2).  If  MOD(n,a)  is  not  zero,  then  a ^ +1  and 

we  may  suppose  that  1 < | a I < |b|.  Then  there 

exists  integers  q , r , q and  r such  that 

112  2 

b=q  .a  + r for  0 < t < I a | 

1 1 1 

and  n = q .a+r  for  0 < r < |a| 

2 2 2 

Therefore , 

a . x + (q  .a+r  ) .y  » q .a  + r 
11  2 2 
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Since  a solution  of  a.x+b.y=n  exists,  r .y-r 

1 2 

is  a multiple  of  a;  that  is,  r .y+a.z  = r . 

1 2 

If  MOD ( r , r ) =0  then  z = 0 and  y = r / r 
2 1 2 1 

is  a solution  to  the  eauation  r ,y+a.z=r  . 

1 2 


Then  by  substituing  in  the  OLiginal  equation, 

a value  of  x for  the  corresponding  v can  found. 

But  if  MOD ( r ,r  ) is  not  zero,  then  proceed  as 
2 1 


before,  usinq  y and  z as  the  variables  and  l as  the 

1 


divisor  since  r < I a I • Therefore,  we  can  find  integers 
1 


q , r , q and  r such  that 
3 3 4 4 


a = q . r + r for  0 < r < r 

3 13  3 1 


and  r = q . r + t for  0 < r < r 

2 4 1 4 4 1 


Therefore , 


r . y = q . r + r - (q  .t  + r ).z 
1 4 14  3 13 


and  as  above, 


r . z + r . w = r . 
3 14 


If  we  continue  in  this  manner,  we  find  that 
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» 

| a I > L'  > t >O.We  can  see  that  each 

1 3 2s-l 

new  set  of  remainders  has  an  upper  bound  r which  is 

2k-l 

smaller  than  the  upper  bound  of  the  preceeding  set  of 

two.  Within  a finite  number  of  steps  the  process  will 

necessarily  end,  for  some  r will  divide  r . 

2k-l  2k 

Supposing  that  this  haopens  when  k = s,  the  equation 

r .u  + r ,v  * l has  the  solution  v = 0 and 
2s-l  2s-3  2s 

u = m;  where  r = m . r . The  oriainal  variables 
2s  2s-l 

can  then  be  determined  by  substitution. 


8. 2. 2. 3 Map  Vector . 

Using  the  above  method  of  solving  Diophantine  equation, 

we  can  establish  a type-8  relation  between  two  nodes  of  a 

directed  qraph.  Actually,  we  are  interested  in  mote  than 

just  the  type-8  relation.  We  would  like  to  test  if  the 

graph  is  "schedulable"  (further  discussed  in  Chapter  9). 

We  associate  with  each  type-8  edge  a vector,  which  will  be 

called  Map  vector  (M  ),  defined  as  follows: 

ij 

M (k)  * p + 1,  if  there  exists  an  inteqer  p such 

ij 

that:  1 + (k-l).c  » 1 + p.c  > 

i i j j 
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for  all  k in  the  range:  1 <=  k <=  l+|j( u -1  ) /c  ] . , 

i i i 

1 and  p in  the  Lanqe: 

0 <=  p < = |^(u  -1  ) /c  j. 

1 1 i 

= 0,  otherwise. 

The  weight  will  also  be  represented  hv  a 6-tuple: 

1 m x 1 n y 

(x  , x , c , y , y , c ) which  defines  M as 

ij  ij  ij  ij  ij  ii  ij 

follows: 

1 1 

M (x  ) = y , 

ij  ij  ij 

lx  1 y 

M (x  +c  )=v  +c  , 
ij  ij  ij  ij  ij 

lxl  y 

M (x  +2.  c )=y  +2.c  , 

ij  ij  ij  ij  ij 

* 

['• 

m m • 

and  M (x  ) = y 

ij  ij  ij  -1 

1 m x 1 m y 

i The  elements  x , x , c , y , y and  c are 

ij  ij  ij  ij  ij  ij 

obtained  from  the  solution  of  the  DioDhantine  equation 
which  is  described  in  the  next  sub-section. 

i 

To  avoid  confusion  in  the  representation  of  a weight 
vector,  the  above  6-tuple  representation  will  be  replaced  . 

I 
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by  the  following  representation: 

1 m x 1 m y 

M - lx  , x , c ] , [y  , y , c ] 
ij  ij  ij  ij  ij  ij  ij 

As  an  example,  if  map  vector  is  given  by: 

M - (0,1, 3, 5, 7,0) 

ij 

then,  the  6-tuple  representation  for  the  above  set  will  be 

M * [2,5,1] , [1,7,2] 

i j 

Again,  if  there  is  no  confusion,  the  subscripts  in  the 

general  representation  will  be  dropped  as  given  below: 

1 m x 1 m y 

M = [x  , x , c ] , [y  , y , c ] 

ij 


The  map  vector  and  its  significance  in  the  consistency 
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fot  all  integral  values  of  k. 

However , we  want  only  those  values  of  k which  satisfy  the 
following  relations: 


X 

>= 

0 

-(7) 

y 

>= 

0 

-(3) 

l 

+ X 

II 

V 

u 

• 

u 

i f c >0  - ( 9 ) 

i 

i 

i 

i 

1 

+ X 

. c > = 

u 

if  c <0  -(9) ' 

i 

i 

i 

i 

1 

+ y 

II 

V 

U 

• 

u 

if  c >0  - ( 10) 

j 

i 

i 

j 

and  1 

+ y 

. c > = 

u 

ifc  <0  -(10)' 

j 

i 

j 

j 

The  Lange  of  k 

can 

be  determined 

from  the  equations 

thiu  (10)  and 

is  given  for 

4 different  cases  in  the 

following: 

Let  P,  Q,  R,  and  S 

be  defined  by 

: 

0 

P = (1  - u 

+ 

x * c ) 

/ c * 

c 

i 

i 

i 

i 

i 

0 

0 

II 

H-* 

1 

c 

+ 

y * c ) 

/ c * 

c 

i 

i 

j 

i 

j 

0 

R * x / c 

1 

0 

s a y / c 

i 

And,  if  the  lower  bound  and  upper  bound  of  the  range  k 
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ate  given 

by 

I 

k and  k tespectively , then  k and  k 

1 2 

1 2 

ate  defined  by  the  following  equations 

in  four  different 

cases : 

0 

Case  (1)  : 

if 

c >0  and  c >0  then 

i 3 

k 

= MAX (CEIL (P) ,CEIL(Q) ) 

-(ID 

1 

k 

= MIN (FLOOR (R) , FLOOR (S) ) 

-(12) 

V 

2 

Case ( 2) : 

if 

c <0  and  c <0  then 

i j 

, 

k 

= MAX (CEIL (R) , CEI L (S ) ) 

-(13) 

1 

k 

= MIN (FLOOR (P) , FLOOR (Q) ) 

-(14) 

I 

2 

Case ( 3) : 

if 

c <0  and  c >0  then 

• 

i j 

V' 

k 

= MAX (CEIL (S) ,CEI L ( P) ) 

-(15) 

i 

1 

k 

= MIN (FLOOR (R) ,FLOOR(0) ) 

-(16) 

2 

l 

and  finally. 

| 

Case  (4)  : 

if 

c >0  and  c <0  then 

1 

i j 

I 

k 

* MAX (CEIL (R) ,CEIL(Q) ) 

-(17) 

1 

k 

- MIN (FLOOR (3) , FLOOR (P) ) 

-(18) 

9 

2 

1 

• 

I 1 ■ 

1 i 

. 
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Now,  if  the  map  vectoi  M is  given  by: 

i j 

1 m x 1 m y 

M = [x  ,x  ,c  ],[y  ,y  ,c  ] 
ij  ij  ij  ij  ij  ij  ij 

then , 


265 


x y 


so,  if  k = 

k , 

then  c = 

c 

= 1 

-(23) 

1 

2 

i j 

i j 

X 

m 1 

Otherwise , 

c 

= ( x - X 

) / 

k - k 

-(24) 

i j 

ij  i 3 

2 1 

y 

m 1 

and 

c 

= (y  - y 

) / 

k - k 

-(25) 

i j 

i j i 3 

2 1 

8. 2. 2. 5 The  CHKCONT  Procedure  (Check  Containment ) . 

The  function  of  the  CHKCONT  procedure  is  to  check  for  a 
solution  of  the  Diophantine  equation  that  represents  the 
intersection  of  the  sets  of  elements  that  are  represented 
by  two  nodes  of  a variable  name.  If  a solution  exists, 
the  routine  establishes  a type-8  relation  between  the  two 
nodes  and  associates  a map  vector  with  it.  The  alqorithm 
A8.13  describes  the  functions  of  the  routine  CHKCONT. 


Alqorithm:  A8.13,  CHKCONT ( I 1 ,J1 ) . 

Function:  Computes  the  map  vector  asociated  with  the 
type-8  edge  between  the  two  nodes  II  and  Jl. 

Calls:  DIOPH. 

Called  by:  SETTYPE. 

StepO:  If  the  dimension  of  the  variable  name  is  greater 


Stepl : 


Step2 : 


Step3 : 

Step4 : 

Step5 : 
Step6 : 


I 

■■  ■ ■ '■■  ■ — ■ — — j 

* 

t 

l 

l 
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than  one,  then  qoto  steo  6. 

Using  the  routine  DIOPH,  check  if  there  exists  any 
solution  for  the  DioDhantine  equation  (4)  qiven  in 
subsection  8. 2. 2.1.  If  there  is  none,  then 
return. 

If  there  is  one  solution  and  that  solution  is  inO 

and  nO,  then  see  if  any  of  the  followinq  two  * 

relations  are  satisfied: 

• 

mO  < (u  - 1 ) / c 
i i i 

nO<(u  -l)/c 

j j 1 

If  any  one  of  the  above  relations  is  satisfied, 
then  no  solution  exists;  return. 

Compute  the  elements  of  the  map  vector  associated 
with  the  type-8  edge  using  the  equations  (19)  thru 
(25)  in  sub-section  8. 2. 1.4.  • 

Set  type-8  relation  between  the  nodes  II  and  J1 
using  the  routine  ADJMSET . Also,  save  the  mao 
vector  obtained  in  step  3. 

Return. 

I 

Using  the  routine  DIOPH,  check  if  there  exists  a 


J J 


solution  for  each  Diophantine  equation  in  the  set 
of  equations  (26)  qiven  in  section  8.2.3.  If  a 


t 
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solution  does  not  exist  for  any  equation,  then 
return. 

Step7:  For  k = 1 to  d perform  steps  8 thru  9. 

Step8:  If  mO(k)  and  nO(k)  is  a oair  of  solutions  to  the  k 

th  Diophantine  equation  in  the  set  of  equations 
(26) , then  check  to  see  if  these  solutions  satisfy 
the  followinq  two  relations: 

k k k 

mO(k)  < (u  -1  ) / c 
i i i 

k k k 

nO(k)  < (u  - 1 ) / c 

j i j 

If  any  of  the  above  relation  is  satisfied,  then  no 
solution  exists,  so,  return. 

Step9:  Continue. 

SteplO:  Compute  the  map  vector  associated  with  each 
dimension  of  the  type-8  edqe  usinq  equations  (19) 
thru  (25)  of  subsection  8. 2. 1.4. 

Stepll:  Set  the  type-8  relation  usinq  the  routine  ADJMSET. 

Also,  save  the  weighted  threshold  obtained  in  step 

10. 

Stepl2:  Return. 


) 

) 

t 

I 

\ 

f 

f 


i 


r 


i 
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8. 2. 2. 6 The  DIOPH  Procedure. 

The  function  DIOPH  described  in  the  program  P8.1,  is  an 
implementation  of  the  method  of  solving  a Diophantine 

equation  outlined  in  sub-section  8. 2. 1.1.  It  uses  the  < 

function  GCD  to  obtain  the  greatest  common  divisor  of  two 
numbers.  It  also  uses  a recursive  procedure  DIOPHL  which  , 

computes  the  series  of  quotients  and  remainders  and  then  1 

obtains  a solution  for  the  equation. 

8. 2. 2. 7 An  Illustrative  Examole. 


Let 

(1  ,u  ,c  ) = (10, 2, -2) 
i i i 

and 

(1  ,u  ,c  ) = (1,10,3)  . 
j 1 3 

Then  the 

Diophantine  equation  is  qiven  by: 

10-2.x=l+3.y 

r 

i.e. , 

-2.x  - 3 . y = -9 

I 

Hence , 

-2.x  - (2+1) . v = -(2  . 4 + 1) 

\ 

i.e. , 

-y  + 2 . s = -1 

. 

Hence , 

-y  + (1+1) . s = -1 

i.e.. 

s + t = -1 

Therefore,  t * 0;  s * -1;  y = -1;  and  x = 6. 

• 

Therefore,  (6,-1)  is  one  solution  to  the  above  Diophantine 

1 

equation,  and  the  general  solution  is  given  by: 

x « 6 - 3.k 


I 

f 


and  y = -1  + 2.k. 

Usinq  equations  (24)  and  (25),  we  get: 
P = ( 10  — 2+6*  — 2) /* 6 
= 4 /6. 

Q = (1-10+  -1*3) /-6 

= 2. 

R = 6/3. 

S = 1/2. 

k = MAX (1,1)  = 1. 

1 

and  k = MIN(2,2)  = 2. 

2 


Now,  using  equation  (20),  we  oet: 
m 

x =6-  1.3  +1=4 

i j 
1 

and  x = 6 - 2.3  +1=1. 

. i j 

Now,  usinq  equation  (21),  we  get: 
1 

y = -1  + 2.2  +1=4 

ij 

m 

and  y =-1+2. 1+1*2. 

ij 


f ' 

' 


And,  finally,  using  equations  (24)  and  (25),  we  get: 
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x 


c 

= 

3/1  = 3 

i j 

y 

and  c 

_ 

-2/1  = -2. 

i j 

Therefore 

, the  map  vector  between  the  nodes  i and  j 

is  given 

by : 

M 

= 

• 

CNI 

1 

CN 

* 

Tt’ 

f—d 

i j 

Pictor ially, 

the  above  two  nodes  can  be  represented 

follows : 

(4, 

0, 

0,  2,  0)  - Map  Vector 

(10, 

X 

8, 

\ 

6,  4,  2)  - Node  i 

/ 

(i. 

¥ 

4, 

7,  10)  - Node  j 

The  arrows  from  node  i to  node  j indicate  the  maopinqs 
between  the  elements  of  the  two  nodes.  If  p th  element  of 
node  i is  mapped  into  the  q th  element  of  node  j,  then  the 
p th  element  of  the  map  vector  is  equal  to  q.  However,  if 
the  p th  element  of  node  i is  not  in  the  set  represented 
by  the  node  j,  then  the  corresponding  element  of  the  map 


vector  is  zero 
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i 

) 

i 

| 
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Program:  P8.1. 


DIOPH:  PROC (A,B,N,X0,Y0)  RETURNS (BIT ( 1 )) ; 

/*  FUNCTION:  OBTAINS  ONE  SOLUTION  TO  THE  DIOPHANTINE 
EQUATION: 

A*X  + B*Y  = N 

IF  THERE  EXISTS  A SOLUTION,  THE  VALUES  ARE  RETUREED  IN 
XO  AND  YO.  IF  NO  SOLUTION,  'O'B  IS  RETURNED  */ 

DCL  (A , X , N , XO , YO , D)  BIN  FIXED; 

D = GCD (A ,B) ; /*GET  THE  GREATEST  COMMON  DIVISOR*/ 
IF  D > 1 THEN  DO; 

IF  MOD (N , D)  = 0 THEN  /*  DOES  NOT  SATISFY 
THEOREM  1.*/ 


RETURN ('O'B); 

/♦SCALE  DOWN  THE  EQUATION*/ 

A = A /D;  B = B / D;  N = N /D; 
END; 

IF  MOD (N,A)  = 0 THEN  DO; 

XO  * N / A;  /*SOLUTION  IS  FOUND*/ 
YO  * 0;  RETURN ( '1  'B) ; 

END; 

IF  ABS(A)  > ABS(S)  THEN  DO; 


CALL  DIOPHL (B,A,N, YO) ; 
XO  * (N=8*Y0)/A; 

END  t 


Ill 


I 

I 

f 

t 

l 

r 

r 

> 

ELSE  DO; 

CALL  DIOPHL (A,8,N,X0)  ; 

YO  = (N-A*X0)/B; 

END; 

RETURN  ( *1 *B)  ; 

DIOPHL:  PROC ( A , B , N , XO ) RECURSIVE; 

DCL  (A,3,N,X0,Q1,Q2,R1,R2)  BIN  FIXED; 

/* IT  IS  ASSUMED  THAT  |A|  > |B|  */ 

Q1  = B / A; 

R1  = B - Q1  * A ; 

02  = N / A; 

R2  = N - Q2  * A; 

IF  MOD ( R2 , Rl ) = 0 THEN  DO; 

/♦FOUND  A SOLUTION*/ 

XO  = R2/R1;  RETURN; 

END; 

* 

CALL  DIOPHL ( Rl , A , R2  , XO ) ; 

XO  = (N  - B * XO)  / A; 

RETURN; 

END  DIOPHL; 

GCD : PROC (A, B)  RETURNS (BIN  FIXED); 

DCL  (M,N,A,B,R)  BIN  FIXED; 

/♦RETURNS  GREATEST  COMMON  DIVISOR  OF  A AND  B */ 

M * A;  N = B;  * 

I 
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I 

I 

) 

( 

i 

\ 


LO:  R = MOD (M,N) ; 

IF  R = 0 THEN  RETURN (N); 
M = N;  N = R;  GOTO  LO; 

END  GCD ; 

END  DIOPH; 


8. 2. 2. 8 Variables  of  Dimension  > 1. 


In  the  previous  sections,  only  the  variables  of 

dimension  one  were  considered.  If  a variable  is  of 

dimension  d,  where  d is  greater  than  1,  and  if  a node  i in 

the  directed  graph  represents  a subscripted  use  of  that 

variable,  then  the  notation; 

111222  ddd 

A[1  , u , c ] [1  , u , c ] ...  [1  , u , c ] 
iiiiii  iii 


will  be  used  to  represent  the  set  of  elements  contained  in 

r 

node  i.  A type-8  relationshio  between  two  such  nodes  can 
be  established  if  there  exists  at  least  one  solution  for 
each  of  the  following  Diophantine  equations: 


1 

1 

1 

1 

1 

1 

m 

. c 

- n . 

c 

* 1 - 

1 

i 

i 

1 

1 

j 

i 

2 

2 

2 

2 

2 

2 

m 

. c 

- n 

c 

- 1 - 

1 

i 

i 

1 

j 

j 

i 

• 

• • • 

iii 
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d d d d d d j 

m.c-n.c=l-l  I 

i i j j j i ' 

12  d 1 2 d 

where  m , m , ...  , m and  n , n , ...  , n are  the 

i i i i j i 

variables  in  the  equations. 

The  solution  to  each  of  the  above  diophantine  equations 

can  be  denoted  by  the  6-tuole  Map  vector  described 

in  section  8. 2. 1.3.  However,  to  extend  the  notation  to  a 

general  case,  the  map  vector  corresponding  to  the  k th 

dimension  will  be  denoted  by: 

k k k k k k 

Ixl  , xm  , cx  , yl  , ym  , cy  ] 

As  an  example,  the  map  vector  associated  with  the  type-8 

edge  between  the  nodes  2 and  3 in  Figure  8.6  is  given 

by : 

11,4,1,1,4,1]  (1,3,1,2,4,11 
and  that  between  nodes  5 and  2 is  given  by: 

(1,3,1,2,4,11  11,1,1,1,1,1] 

8. 2. 2. 9 A Special  Case  of  Subscripted  Variables. 

The  general  representation  of  a subscripted  variable 


node  which  is  given  by 
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111222  d d d 

A(l»UfC][l(UfC)  •••  [ 1 , U , C ] 

iiiiii  iii 

for  a variable  of  dimension  works  very  well  when  an 
assertion  maps  a set  of  elements  of  one  dimension  of  a 
source  variable  to  a set  of  elements  in  the  same  or 
another  dimension  of  the  target  variable.  However,  in 
some  cases,  a set  of  elements  of  one  dimension  in  a source 
variable  may  be  mapped  to  a set  of  elements  distributed 
among  more  than  one  dimension  of  the  target  variable. 
Following  assertion  is  an  example  of  such  a case: 

FOR  I = 2 TO  4 A (I, I)  = A (1,1) 

In  the  above  assertion,  the  target  variable  represents  the 


elements  A ( 2 , 2) , 

A(3,3)  and 

A (4, 

4)  which  cannot 

be 

represented  using 

the  general 

form 

above.  However, 

in 

such  cases,  we  can  merge  the 

two 

dimensions  into 

one 

dimension.  For  example,  if  the  dimension  of  the  variable 
is  2 and  both  dimensions  use  the  same  subscriot  name,  as 
in  the  avove  assertion,  then  the  merged  representation  of 

the  variable  is  given  by: 

$ 


2 


Figure  8.6.  Directed  Graph  for  the  Assertions: 

AO:  A( 1, 1)  - 1; 

Al:  FOR  I - 1 TO  4 & J - 2 TO  4 
A(I,J)  - A(I, J-l)*2 ; 

A2 : FOR  I - 2 TO  4 A( 1,1)  - A(I-1, 4) ; 


- Ill  - 


1 • 1 ' 1 • 

A [ 1 , u , c ] 

i i i 


where : 


1 = (1  - 1)  . x + 1 


u = (u  - 1)  . x + u 


1 ' 1 

C = C .X  + c 

i i j 

and  x is  the  maximum  number  of  elements  in  the  first 
dimension  of  the  variable. 

For  example,  the  variable  name  A(I,I)  will  be 
represented  by  A[6,16,5]  when  the  variable  size  * 4x4  and 
I varies  from  2 to  4,  as  in  the  above  assertion.  Now  the 
CHKCONT  procedure  can  be  extended  to  check  the 
intersection  of  a normal  representation  of  a variable  name 
with  another  merged  representation.  This  extension  will 
be  illustrated  using  the  example  in  the  directed  graph  of 
Figure  8.7. 

If  the  node  (2)  was  represented  normally,  then  the 
Diophantine  equations  1 and  2 in  the  set  of  equations  (26) 
would  have  been  used  to  check  the  containment.  The  same 
concept  is  extended  by  using  the  Diophantine  equation  with 
more  than  one  variable.  In  other  words,  there  exists  a 


3 


A(2,4,l)  (1,1,1) 

(1) 


Figure  8 . 


CL 

4 

OA(6, 16, 5) 
(2) 


7.  Directed  Graph  for  the  Assertions: 
Al:  FOR  I ' 2 TO  4 A(I,I)  - A(1,I); 
A2:  FOR  I - 2 TO  U J - 2 TO  4 
B(I, J)  - A(I, J) ; 
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type-8  edge  between  the  nodes  (2)  and  (3)  if  there  exists 
a solution  to  the  DioDhantine  eauation: 

6 + 5 . x * (2-1  + y.l).a  + 2 + z.l 
where  a is  the  size  of  the  last  dimension  of  A,  which  is  4 
in  the  oresent  example.  The  above  equation  is  a 

Diophantine  equation  in  3 variables  x,  y and  z,  and  can  be 
rewritten  as: 

5.x  - 4.y  - z * 0 

In  general,  if  two  dimensions  dl  and  d2  (such  that  d2  > 
dl)  of  a variable  node  i use  the  same  subscript  name,  and 
all  the  subscript  names  corresponding  to  the  different 
dimensions  of  node  j are  unique,  and  the  node  i is  a 
type-8  predecessor  of  node  j,  then  the  Diophantine 
equation: 

dl'  dl*  dl  dl  d2  d2 

1 +c  . x * (1  -1  + c . y).a  +1  +c  .z 

i i j j rfld2  j j 

will  be  used  to  establish  type-8  relation  between  the 

nodes  i and  j. 

a used  in  the  above  equation  is  a constant  defined 
dld2 

as  follows: 

a - 1 if  dl»d2, 
dld2 

- s .a  for  all  dl<d3<«d2 

d3  did  3 
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where  s is  the  size  of  the  d3  th  dimension  of  the 
d 3 

variable. 

The  same  concept  can  be  applied  if  more  than  two 
dimensions  of  a variable  represented  by  the  node  i use  the 
same  subscript  name;  and  also,  if  such  a node  exists  in 
the  head  and/or  tail  of  a type-8  edge.  A solution  of  a 
Diophantine  equation  of  3 variables  can  be  obtained  using 
standard  methods. 

There  is  an  alternate  method  to  find  the  type-8 
relationship  between  the  special  nodes  described  above. 
This  method  is  sketched  in  the  following  steps. 

Stepl:  Let  i be  a target  variable  node,  all  the 

dimensions  of  which  use  unique  subscript  names. 
Step2:  Let  j be  a source  variable  node,  the  p th  and  q th 

dimensions  of  which  use  the  same  subscript  name. 
Also,  assume  that  p < q. 

Step3:  There  exists  a tyoe-8  relation  between  i and  j if 

there  exists  a solution  for  the  Dioohantine 
equation: 


• ■ ' ” f — ■ ’ 

1 

i 

1 

( 

1 

1 

l 

: 
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• 

1 

+c.x*l+c.y  -(27) 

i 

i j i 

where 

- 

P a q 

1 

= (1  - 1)  . S +1 

i 

i i i 

p q q 

u 

= (a  - 1)  . S + u 

i 

i i i 

| 

q 

\ 

c 

= c 

| 

• 

i 

i 

p q q 

i 

1 

= (1  - 1)  . S +1 

j 

i i i 

p q q 

i 

( 

u 

= (u  - 1)  . S + u 

j 

i i i 

t 

p q q 

c 

= c . S + c 

4 

• 

j 

j i j 

and 

r 

• 

q 

q 

q q 

S 

= u 

1 4-  C 

•- 

i 

i 

i i 

As 

an  example,  consider  the  node  i (denoted  by: 

A(l,5 

,1) (1,5, 

2)  ) representinq  the  set  of  elements: 

A ( I , J) 

for  I * 1 to  5 & J * 1 to  5 by  2; 

i 

and  the  node 

1 (denoted  by  A(2,4,l)(2,4,l)  ) representing 

1 

the  set  of  elements: 

j • 

J 

A(I,I) 

for  I ■ 2 to  4, 

1 
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Assuminq  that  A is  of  size  5x6,  we  can  see  that  the 
node  i represents  the  underlined  elements  in  the  following 
matrix: 


1 2 3 4 5 6 
7 8 7 10  IT  12 
l7  14  15  16  17  18 
T7  20  21  22  75  24 
27  26  77  28  77  30 


whereas,  the  node  j represents  the  underlying  elements 
in  the  following  matrix: 


1 2 3 4 5 6 

7 8 9 10  11  12 

13  IT  15  16  17  18 

19  20  IT  22  23  24 

25  26  27  27  29  30 


Using  the  set  of  equations  (28),  we  get: 


q\ 

S = 5 

i 

l = o 

i 

u * 4 

i 

c * 2 

i 

1*1 

j 

u * 3 

j 


1 + 2 = 6 

6 + 1 = 1 

6 + 5 = 29 


6+2  = 8 

6 + 4 = 22 


- IITTI 


And  the  Diophantine  equation  (27)  becomes: 

1 + 2.x  = 8 + 7.y 

Using  the  method  of  solution  described  earlier,  we  can 
get  the  map  vector  between  the  two  nodes,  and  is  qiven  by: 
W = [8,15,1,2,4,2] 

i j 

q 

Note  that  S used  in  the  set  of  equations  (28)  is  a 
i 

fictious  size  of  the  q th  dimension  of  the  variable.  This 
size  is  used  to  obtain  a linear  transformation  of  the 
elements  represented  by  the  two  dimensions  p and  q of  the 
variable  into  one  dimensional  array.  That  is  why  we  were 
in  a position  to  use  the  Diophantine  equation  of  only  two 
variables  to  derive  the  type-8  relationship. 


t 
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CHAPTER  9 

THE  COMPLETENESS  AND  CONSISTENCY  ANALYSIS 

In  this  chapter,  the  completeness  and  consistency 
analysis  of  MODEL  specification  will  be  described.  An 
overview  of  the  completeness  and  consistency  analysis 
phase  of  the  MODEL  processor  is  shown  in  Figure  9.1.  The 

completeness  criteria  consists  essentially  of  the  \ 

requirement  that  the  directed  qraph  (that  represents  the 
MODEL  specification)  is  well-formed  in  the  followinq 
sense: 

(1) .  Each  row  and  column  in  the  precedence  matrix  must 

have  at  least  one  entry  of  a precedence  relationship 
with  the  followinq  exceDtions:  (a)  Rows  representing 
the  source  media  statements  may  not  have  any  entry 

and  the  columns  representinq  target  media  statements  * j 

I 

may  not  have  any  entry.  (b)  Rows  representing 
source  field  names  may  not  have  any  entry  (when  the 

y\ 

field  is  never  used  to  define  any  other  varaible).  i 

(2) .  If  a node  represents  a set  of  elements  of  an  array 

and  that  node  is  used  as  source  of  an  assertion, 
then  the  union  of  the  set  of  elements  represented  by 
its  predecessor  nodes  must  be  the  same  as  (or  a 
superset  of)  the  set  of  elements  represented  by  the 
node. 
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Figure  9.1.  Overview  of  Phase-3  of  MODEL  Processor 
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(3).  Each  repeatinq  data  name  must  be  subscripted  by  an 
appropriate  number  (equal  to  the  dimension)  of 
subscripts . 

The  consistency  criteria  ascertains  that  the  directed 
graph  is  sequenceable  in  the  following  sense: 

(1) .  There  must  not  be  any  cycles  in  the  specification. 

That  is,  no  node  must  be  its  own  predecessor,  either 
directly,  or  indirectly. 

(2) .  If  iterations  are  present  in  the  specification,  then 

the  beginning  and  end  of  each  iteration  must  be 
properly  structured. 

(3) .  If  the  elements  of  an  array  ate  recursively  defined, 

then  that  recursion  must  be  sequenceable. 

Section  9.1  describes  the  algorithms  used  in  the 
completeness  analysis  of  then  MODEL  specification.  The 
MODEL  statements  are  represented  in  a special  type  of 
directed  graph,  called  array  graph,  for  ease  of  analysis. 
Section  9.2  describes  the  concept  of  array  graphs  and  the 
results  on  the  sequenceabil ity  of  array  graphs.  Section 
9.3  describes  the  algorithms  used  in  the  consistency 
analysis  of  the  MODEL  specification.  Also,  Fiqures  9.2 
and  9.3  give  the  functional  overview  of  the  completeness 
and  consistency  analysis  procedures  respectively.  Some 
examples  are  described  in  section  9.4. 


Figure  9.2.  Overview  of  Completeness  Analysis. 
(Procedure  RESIC) . 
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Figure  9.3.  Overview  of  Consistency  Analysis. 
(Procedure  RICONS) . 
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9.1  Completeness  Analysis. 

This  section  describes  the  algorithms  used  in  the 
completeness  analysis  of  *IODEL  specification.  As 
mentioned  in  the  beginninq  of  this  chapter,  completeness 
analysis  can  be  divided  into  three  Darts.  The  first  part 
checks  for  the  completeness  of  the  precedence  matrix 
(Procedure  RESIC)  , and  the  second  part  checks  that  every 
element  of  an  output  array  is  defined  (Procedure  RESIC8) . 
In  both  cases,  the  incompleteness  is  resolved  by 
generating  appropriate  assertions  or  data  description 
statements. 

The  third  part  checks  for  completeness  in  subscript 
specification.  If  a data  name  is  repeating,  but  no 
subscript  (or  not  enough  subscript)  is  specified,  then 
default  subscripts  ('*')  are  supplied. 

In  the  following,  the  above  three  parts  are  described 
in  greater  detail. 

9.1.1  Incompleteness  in  Precedence  Relations . 

For  a directed  graph  to  be  complete,  it  must  be  well 
formed  in  that  there  must  be  an  entry  in  each  row  and 
column  of  the  precedence  matrix,  except  in  the  following 
special  cases: 

(1).  A column  that  represents  an  "input  node"  need  not 
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contain  any  entry.  The  input  nodes  are  those  nodes 
that  represent  (a)  source  media  statements,  and  (b) 
assertions  that  define  variable  names  by  constants. 

(2) .  A tow  that  represents  an  "outout  node"  need  not 

contain  any  entry.  In  MODEL,  the  nodes  that 
represent  target  media  statements  are  the  only 
output  nodes. 

(3) .  A row  that  represents  a source  field  name  (or  an 

interim  data  name)  need  not  have  any  entry.  In  this 
case,  the  particular  name  is  never  used  to  define 
any  other  name. 

If  the  above  completeness  criteria  are  not  satisfied, 
an  appropriate  data  description  statement  or  an  assertion 
is  generated  according  to  the  following  rules: 

(1).  If  the  node  under  consideration  reoresents  a file, 
record,  group  or  field  data  name,  and  there  does  not 
exist  a parent  for  that  data  name  (that  is,  if  the 
column  or  row  of  that  data  name  does  not  contain  any 
entry  according  as  that  data  name  is  a source  data 
name  or  target  data  name  respectively) , then  a 
corresponding  data  description  statement  is 


i 


- 291  - 


generated  as  given  by  the  following  table: 


"T 


1 

1 

WHEN  THE  DATA  NAME 

IS  A 

I THE  GENERATED  STATEMENT 

1 

IS  | 

1 

T 

1 ' 

1 

1 

SOURCE 

FIELD 

I $ INREC  IS  RECORD; 

1 

1 

SOURCE 

GROUP 

| $INREC  IS  RECORD; 

1 

1 

TARGET 

FIELD 

I $OUTREC  IS  GROUP; 

1 

1 

TARGET 

GROUP 

| $OUTREC  IS  GROUP; 

1 

• 

| SOURCE  RECORD 

1 SYSIN  IS  FILE; 

I TARGET  RECORD 

| SYSPRINT  IS  FILE; 

I FILE  (SOURCE  OR 

1 

• 

I TARGET) 

1 

I SDISK  IS  MEDIA 

1 

Also,  the  data  name  in  the  generated  statement  is 


assumed  to  be  the  parent  of  the  data  name  under- 
consideration.  The  precedence  matrix  is  also 
updated  to  include  the  parent-daughter  relationship 
resulting  from  the  generated  statement. 

(2).  If  the  node  under  consideration  represents  an 

interim  data  name  or  a tarqet  field  name,  and  if 
there  does  not  exist  any  entry  in  the  column  of  that 
node,  then  an  assertion  is  generated  according  to 
one  of  the  following  rules: 

(a).  If  there  exists  a similarly  named  source  or 
interim  data  name,  then  that  name  is  used  to  define 
the  data  name  usinq  an  assertion.  For  examDle,  if 
the  node  under  consideration  reoresents  the  field 
name  "REPORT . TRANS # " , and  there  exists  another  field 
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name  "TRANS#"  in  a source  file  "TRANSACTION",  then 
the  assertion 

REPORT. TRANS#  = TRANSACTION . TRANS # ; 
is  generated. 

(b) . If  the  incompleteness  could  not  be  resolved  by 
(a)  above,  then  one  of  the  following  two  assertions 
is  generated  according  as  the  data  name  is  defined 
to  be  of  type  arithmetic,  or  character  respectively: 
<datname>  = 0; 

<data-name>  = ' 

There  are  two  exceptions  to  the  rule  (a)  above.  These 
are : 

(i) .  If  the  data  name  is  "FILLER",  then  the  rule  (b)  is 

used  even  though  there  may  be  a similar  source  data 
name. 

(ii) .  If  the  dimension  of  the  data  name  under 

consideration  is  smaller  than  the  similarly  named 
source  data  name,  then  the  reduction  function  ANY  is 
used  to  select  a particular  element  of  source  data 
name.  For  example,  for  the  following  set  of 
statements : 

TRANS  IS  FILE; 

PRODUCT  IS  RECORD (TRANS  ,(*)); 

TRANSACTION  IS  GROUP (PRODUCT, (*)) J 

PRODUCT_NUMBER  IS  FIELD (TRANSACTION ) ; 
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REPORT  IS  FILE; 

REPl  IS  RECORD (REPORT, (*) ) ; 

PRODUCT_NUMBER  IS  FIELD (REPl ) ; 

the  followinq  assertion  is  generated: 

REPl. PRODUCT  NUMBER  = ANY (TRANS . PRODUCT  NUMBER); 


The  Dtocedures  GIMPA  and  GIMPI  are  used  to  perform  the 
functions  of  rules  (a)  and  (D)  respectively. 

9.1.2  Incompleteness  in  Computing  Elements  of  an  Array. 

If  an  array  is  defined  as  a target  field  or  interim, 
then  there  must  exist  an  assertion  to  define  every  element 
of  that  array.  If  not,  the  specification  is  incomplete. 
Because,  the  nodes  of  our  directed  graph  represent  a set 
of  elements  of  an  array,  such  incompleteness  cannot  be 
detected  by  analyzing  the  entries  in  the  precedence 
matrix.  Therefore,  some  other  procedure  must  be  adopted. 
The  Procedure  RESIC8  is  used  for  this  nurpose.  The 
functions  of  this  procedure  is  summarized  in  the 
following : 

Stepl.  Obtain  all  the  type-8  predecessors  for  each  node 
in  the  directed  graph  (using  the  routine  OBTf P) . 
Step2:  For  each  node,  i,  in  the  qraph,  perform  steDS  (3) 

thru  (4) . 

Step3:  Find  the  union,  U,  of  the  set  of  elements 

represented  by  the  type-8  oredecessors  of  node  i. 
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Subtract  CJ  from  the  set  represented  by  the  node  i 
(using  routine  RANGSR) . 

Step4:  If  the  resulting  set,  R,  obtained  in  steo3  is  not 

empty,  then  those  elements  in  R aie  not  defined  by 
any  assertion.  Therefore  using  the  procedure 
GAFSUBS , generate  a default  assertion  to  define 
every  element  in  R. 

The  routine  OBT#P  used  in  steol  above  comoutes  some 
vectors  that  are  used  later  on  during  the  consistency 
analysis  phase  (see  section  9.3.2).  Some  of  these  vectors 
are:  # PRED  (#  of  predecessors),  IPRED8  (#  of  type-8 
predecessors),  #SUCC8  (#  of  type-8  successors),  PREDL  (a 
list  of  all  predecessors),  PRED8L  (a  list  of  all  tyoe-8 
predecessors) , and  SUCC8L  (a  list  of  all  type-8 
successors).  The  use  of  these  vectors  is  described  in 
section  9.3.2. 

To  illustrate  the  functions  of  the  procedure  RESIC8, 
consider  the  following  set  of  statements: 

A(l)  » 0; 

FOR  I * 3 TO  9 BY  2 A(I)  * A(I-2)> 

A IS  INTERIM ( 10)  ; 

In  the  above  example,  the  elements:  A(2.I),  1*1  to  5, 
are  not  defined,  and  therefore,  the  following  assertion  is 
generated  by  the  system: 
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FOR  I = 1 TO  5 A (2*1)  =0; 

9.1.3  Completeness  Analysis  of  Subset ipts . 

Basically,  the  completeness  analysis  of  subscripts 
consists  of  the  following  steps: 

(1) .  Obtain  the  tange  and  dimensionality  of  each  source, 

target  and  interim  data  names  and  then  propagate 
them  to  the  assertions  that  use  them  (Procedure 
GRNGDIM) . 

(2) .  Identify  the  "fot-each"  names  (or  '**)  used  in  each 

assertion  with  one  of  the  supplied  subscript  names, 
or  generate  a new  subscript  statement  for  that 
"for-each"  name  (Procedure  IFENAMS) . 

(3) .  Obtain  a list  of  all  data  names  that  use  a subscript 

name  and  obtain  a parent  for  that  subscript  name 
from  one  of  the  names  in  the  list  (Procedure 
CSUBLST) . 

(4) .  Analyze  the  list  obtained  in  step  (3)  for  redundancy 

& inconsistency  (Procedure  CSUBRED) . 

(5) .  Supply  "for-each"  expressions  for  each  assertion 

that  uses  repeating  data  name  (Procedure  SUPFORE) . 
The  procedures  that  perform  the  above  functions  are 
described  in  the  following. 
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9. 1.3.1  The  Procedure  GRHGDIM  (Get  Ran^e  and  Dimension)  . 

This  procedure  obtains  the  Lange  and  dimensionality  of 
every  data  name  in  the  dictionary.  If  the  obtained 
dimension  is  different  from  the  number  of  subscripts  used 
for  the  same  name  in  any  assertion,  then  the  assertion  (ol 
the  data  description  statement)  is  modified.  For  example, 
consider  the  following  set  of  statements: 

A IS  GROUP ((*)); 

C IS  FIELD  (A, (50) ) ; 

X IS  INTERIM  ( (*,50) ) ; 

Al:  X ( I ) = C ( I ) ; 

In  this  example,  the  number  of  subscripts  soecified  for 
the  data  names  X and  C in  the  assertion  is  different  from 
the  actual  dimension  (obtained  from  the  corresponding  data 
description  statements).  Therefore,  the  assertion  Al  is 
changed  to: 


Al:  X ( * , I ) = C(*,I)  ; 

However,  if  the  data  description  for  the  interim  data  name 
X in  the  above  example  was: 

X IS  INTERIM; 

Then  the  procedure  GRNGDIM  changes  the  dd  statement  for  X 


X IS  INTERIM  ((*)')  ; 
and  the  assertion  is  changed  to: 
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Al:  X ( I ) = A ( * , I ) ; 

Later  on,  during  the  subscript  proDaqation  (see  the 
description  of  the  Procedure  SUBPROP),  the  data 
description  of  X will  aqain  be  changed  to: 

X IS  INTERIM  ((*,*)); 
and  the  assertion  will  oe  changed  to: 

Al:  X(*,I)  = C(*,I)  ; 

9. 1.3. 2 The  Procedure  IFENAMS  (Identify  "FOP-EACH"  Names) . 

If  a repeating  data  name  is  not  subscripted  in  an 
assertion,  then  the  data  name  in  the  assertion  is  changed 
to  include  the  default  subscript  name  ("for-each"  or  "*") 
as  described  in  the  earlier  sub-section.  Alternately,  the 
user  might  have  used  those  for-each  names  himself  as  in 
the  following  assertion: 

A ( * ) * B (*)  + C ( * ) ; 

The  function  of  the  procedure  IFENAMS  is  to  supply  an 
appropriate  subscript  name  for  each  "for-each"  name  in  an 
assertion.  If  a "for-each"  name  is  used  as  a subscript 
for  the  k th  dimension  of  a data  name  <dname>,  then  the 

subscript  is  changed  to  a subscript  name  usinq  the 

following  rules: 

(1).  If  a subscript  name  is  already  specified  for  the  k 

th  dimension  of  <dname>  (usinq  a subscript 
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statement),  then  that  subscript  name  is  used  as  the 
subscript . 

(2) .  If  no  subscript  name  is  specified  for  the  k th 

dimension  of  <dname>,  then  the  assertion  in  which 
the  <dname>  is  used  is  searched  for  a subscripted 
data  name  and  an  acceptable  subscript  name.  If  a 
subscript  name  is  found,  then  that  subscript  name  is 
used  as  the  subscript  for  the  k the  dimension  of 
<dname>. 

(3) .  If  the  search  in  steps  (1)  and  (2)  is  unsuccessful, 

then  a new  subscript  name  is  supplied.  Also,  the 
same  name  is  propagated  to  the  k th  dimension  of 
every  variable  used  in  the  assertion,  if  "fot-each" 
was  specified  for  that  dimension. 

For  example,  if  the  assertion  is  of  the  form: 

Al:  A ( * ) = B ( * ) + C ( * ) ; 

then,  the  assertion  is  changed  to: 

Al:  A ( FOREACH$A)  = B(FOREACH$A)  + C ( FOREACH$A) ; 
and  the  following  subscript  statement  is  generated: 
FOREACH$A  IS  SUBSCRIPT ( A) ; 


Note  that  the  parent  name  is  arbitrarily  chosen  among  the 
possible  candidates.  This  parent-daughter  association  may 
be  changed  after  further  analysis  in  procedure  CSUBLST, 
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which  is  described  in  the  next  section.  i 

If  another  assertion  A2  was  also,  specified: 

A2 : X(*)  = Y(*)  * C(*)  ; 

f 

The  for-each  names  in  this  assertion  i-s  chanqed  to 
FOREACH$A,  because,  the  for-each  name  associated  with  C 
was  changed  to  FOREACH$A  while  analyzing  the  assertion  Al. 

Therefore,  the  assertion  A2  is  changed  to: 

A2 : X ( FOREACH$A ) = Y ( FOREACH$A)  * C ( FOREACH$A ) ; 

As  another  example,  if  the  assertion  is  given  by: 

A3:  AA ( * , * ) = BB ( * ) + CC(*)  ; 

Then,  two  subscript  names  will  be  supplied,  thus  chanqing 
the  assertion  to: 

A3:  AA ( FOR EACH $AA1 , FOREACHS AA2 ) = BB (F0REACH$AA2 ) + 

CC (F0REACH$AA2) ; 

While  a subscript  is  supplied  to  a data  name,  the 
derivation  tree  of  the  cor respond inq  assertion  (in  which 
the  data  name  is  used)  must  also  be  updated.  This  is  done 
in  Procedure  (JPDASSN.  The  procedure  GSUBNPT  is  used  to 

i 

obtain  a new  subscript  name.  Whenever  a new  subscript 
statement  is  generated  for  a data  name,  then  that  data 
name  is  assumed  to  be  the  parent  of  the  subscript  name. 

However,  if  the  data  name  is  not  repeating,  then  the 
parent  structure  of  that  data  name  is  searched  for  the 
repeating  data  name.  For  example,  for  the  following  set 
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of  statements: 

A IS  GROUP (*) ; 

B IS  FIELD (A, (*) ) ; 

C (*,*)  =■  B ( * , * ) ; 

two  subscti.pt  statements  are  qenerated  for  the  two 
subscripts,  and  the  assertion  is  changed  to: 

C (FOREACH$A, FOREACHSB)  = B ( FOREACH$A , FOREACHSB ) ; 
Also,  the  data  names  A and  0 are  assumed  to  be  the  parents 
of  the  subscript  names  FOREACH$A  and  FOREACHSB 
respectively.  The  procedure  GEXACTD  is  used  for  obtaininq 
the  repeating  data  name. 

Also,  whenever  a subscript  statement  is  qenerated,  the 
procedure  SSUBRNG  is  called  to  set  the  ranqe  for  the 
subscript  from  the  corresponding  ranqe  in  the  data 
description  statement  of  its  patent  repeating  data  name. 

9 . 1 . 3 . 3 The  Procedure  CSUBLST  ( Create  Subscript  List ) . 

This  procedure  creates  a list  of  all  data  names  that 
use  a particular  subscript  name.  Also,  from  this  list,  it 
obtains  a parent  name  for  the  subscript  name  using  the 
followinq  process: 

(1).  Search  for  the  data  name  in  the  list  that  has  the 
largest  dimension.  If  more  than  one  data  name  with 
the  largest  dimension  exists,  then  search  for  an 
arbitrary  source  data  name  among  them. 
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(2).  If  a patent  name  is  not  associated  with  the 
subscript,  or  if  the  specified  patent  name  has  a 
dimension  less  than  that  of  the  data  name  obtained 
in  step  (1),  then  the  patent  name  is  changed  to  the 
data  name  obtained  in  step  (1). 

As  an  example,  if  the  data  name  list  for  a subscript 
name  I is  given  by: 

A ( I ) , B ( I ) , X ( I ) . 

and  X ( I ) and  B(I)  are  source  data  names,  and  I does  not 
have  any  parent,  then  either  B or  X is  supplied  as  the 
patent  name  for  I. 

As  another  example,  if  the  data  name  list  for  a 
subscript  name  I is  given  by: 

A(I,J)  , 3(1)  , C (I) . 

Then,  A is  assumed  to  be  the  parent  of  I even  if  A is  a 
target  variable. 

9. 1.3.4  The  Procedure  CSUBRED  (Check  Sub.  Redundency) . 

This  procedure  is  used  to  check  redundancies  and 
inconsistencies  in  subscript  specification.  The  check  is 
performed  on  the  list  of  data  names  associated  with  a 
subscript  name  (which  is  obtained  using  the  procedure 
CSU8LST  described  in  the  earlier  sub-section).  Subscript 
inconsistency  is  due  to  the  use  of  same  subscriot  name  for 
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two  different  souice  data  names  or  two  different  tarqet 
data  names.  For  example,  consider  the  followinq  set  of 
statements : 

G IS  GROUP; 

A IS  FIELO(G, 100)  ; 

B IS  FIELD(G, 100)  ; 

C IS  FIELD(X, 100)  ; 

C ( I ) = A ( I ) + B ( I ) ; 

In  this  example,  the  subscript  name  I is  used  for  the  data 
names  A and  B,  which  are  not  related.  This  type  of 
inconsistency,  in  many  cases  may  be  acceptable,  and 
therefore,  is  not  treated  as  error,  but  a warninq  is  sent 
to  the  user  for  each  such  inconsistency. 

Subscript  redundancy  is  due  to  the  use  of  different 
subscript  names  for  the  same  data  name.  For  example,  if 
the  list  of  data  names  associated  with  two  subscript  names 
I and  J are  qiven  by: 

A ( I ) , B ( I ) , X ( I ) ; 
and  C(J) , B ( J) , Y (J) . 

then  the  data  name  B is  subscripted  by  both  I and  J. 
Therefore,  one  of  the  names  can  be  replaced  by  the  other. 
If  J is  changed  to  I,  the  subscript  statement  for  J is 
deleted  and  the  list  of  data  names  associated  with  the 
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data  name  I is  changed  to: 

A(I) , B ( I ) , X ( I ) , C ( I ) , Y (I)  . 

However,  there  is  one  exception  to  the  above  rule.  If  two 
subscript  names  I and  J ate  used  as  subscriots  for  a data 
name,  but  the  parents  of  I and  J are  the  same  name  (I  and 
J being  associated  with  the  different  dimensions  of  that 
data  name),  then  there  is  no  redundancy.  For  example,  if 
the  data  name  lists  associated  with  I and  J are  given  by: 
A ( I ) , B ( I , J ) ; 
and  A ( J) , B ( I , J) . 

respectively,  and  if  I is  associated  with  the  first 
dimension  of  B and  J is  associated  with  the  second 
dimension  of  B,  then  identifying  the  subscript  name  I with 
the  name  J will  violate  the  second  criteria  used  in  the 
subscript  consistency  check  of  procedure  ASU8ST. 
Therefore,  in  this  case,  I and  J are  treated  as  two 
different  subscripts.  This  type  of  situation  can  arise  in 
cases  like  the  following: 

B(I,J)  « X ( I ) * Y ( J ) ; 

In  this  case,  matrix  B is  assumed  to  be  the  outer  product 
of  the  vector  A. 

If  two  subscript  names  are  found  to  be  equivalent  (that 
is,  one  of  them  is  redundant),  then  one  of  the  names  is 
"deleted"  from  the  associative  memory.  (Actually,  the 
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ledundant  subscript  name  is  not  deleted  physically.  But, 
a linked  list  of  all  equivalent  subscript  names  is 
maintained  in  the  associative  memory.  Also,  an  error  code 
is  moved  to  the  data  area  of  the  storaqe  entry  of  every 
member  in  the  equivalent  class,  except  one,  to  indicate 
that  that  statement  is  a deleted  statement.) 

9. 1.3, 5 The  Procedure  SUPFORE  (Supply  For-each  Exons . ) . 

This  procedure  supplies  for  each  expressions  for  every 
assertion  in  which  a subscript  data  name  is  used,  but  no 
range  for  that  subscript  is  specified.  For  example, 
consider  the  following  two  statements: 

I IS  SUBSCRIPT (A,  (1,100,1) ) ; 

A ( I ) = B ( I ) + C ( I ) ; 

In  this  example,  the  assertion  does  not  contain  any  ranqe 
for  the  subscript  name  I.  Therefore,  the. range  for  I is 
obtained  from  the  subscript  statement  for  I,  and  the 
assertion  is  changed  to: 

FOR  I = 1 TO  100  BY  1 A(I)  = B(I)  + C(I); 

It  uses  the  procedure  MERGSUB  described  in  section  8. 2. 3. 3 
to  supply  the  for-each  expressions. 

9.1.4  Some  Additional  Processing. 


Once  the  completeness  is  complete,  two  additional 
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functions  ate  performed  by  the  toutine  RESIC.  These  ate: 

(1) .  Establishes  type-9  telationships  between  the  sister 

nodes  in  the  ttee  structure  cottespondinq  to  the 
input,  output,  ot  interim  data  names.  This  function 
is  petfotmed  by  the  routine  SETT9. 

(2) .’  Obtains  the  number  of  different  representations  for 

a name.  This  information  is  stored  in  IOFENTRIES  of 
the  structure  "DATA-LIST"  (see  section  6. 1.4.1). 

This  entry  is  used  in  the  documentation  phase  to 
supply  minimum  number  of  aual if ications  for  any  data 
name.  The  Procedure  SET#OFE  is  used  to  compute 
tOFENTRIES . 
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9.2  Array  Graph  Representation. 

In  this  section,  we  will  define  the  concept  of  an 
"array  qraph",  which  is  a qeneralized  representation  of  a 
directed  qraph.  In  general,  an  array  qraph  is  a compact 
representation  of  a conventional  directed  qranh  in  which 

each  node  represents  a variable  name  or  an  element  of  a ^ 

variable  name.  We  are  interested  in  performing  the  qraph 
algorithms  directly  on  the  array  qraph.  In  particular,  we 
are  interested  in  obtaining  a "schedule"*  for  the  array 
qraph.  We  will  call  the  array  graph  seguenceable  if  the 
underlying  graph  (which  is  eauivalent  to  the  otiqinal 
graph,  and  in  which  each  node  represents  a sinqle  element) 
is  acyclic.  We  will  see  that  not  all  array  graphs  are 


seguenceable;  however,  we  can  decompose  a set  of  nodes  of 


some  of  those  array 

graphs,  and  obtain  a 

schedule  for 

the 

decomposed  graphs. 

Using  appropriate 

definition 

of 

. L 

decomposabi 1 ity , we 

will  also  derive  a 

necessary 

and 

sufficient  condition 

for  sequenceability. 

We  will 

also 

* Informally,  schedule  is  a sequence  of  two  control 
elements  (called,  do-element  and  end-element),  and  the 
compound  nodes  in  the  qraoh.  The  nodes  in  the  sequence 
are  topologically  sorted.  If  a set  of  nodes  are  to  be 
iterated,  then  that  set  will  be  preceded  by  a 
do-element,  and  succeded  bv  an  end-element  in  the 
schedule.  A formal  definition  of  schedule  is  given  in 
section  9.2.3. 
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call  a graph,  "schedulable"  if  we  can  obtain  a schedule 
for  that  graph.  We  will  show  that  not  all  seguenceable 
graphs  are  schedulable,  however,  we  can  decompose  a 
sequenceable  graph  to  obtain  a schedulable  graph.  We  will 
describe  our  graph  analysis  after  introducing  some 
definitions.  An  overview  of  the  results  on  the  array 
graph  is  shown  in  Figure  9.4. 

To  make  our  graph  analysis,  we  will  use  a simpler 
representation  of  our  directed  qtaph.  This  simpler 
representation  consists  of  only  two  types  of  edges:  (1)  01 
edge-connecting  a source  variable  node  and  a target 
variable  node  (this  is  represented  by  a type-3  edge,  an 
assertion  node,  and  a type-4  edge  in  the  actual 
representation  described  in  earlier  chapters)  and  (2)  02 
edge  - connecting  a target  variable  node  and  a source 
variable  node  (this  edge  is  the  same  as  the  tyoe-8  edge 
described  earlier). 
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9.2.1  A it a y Graph . 

An  array  qtaph  G is  a pair  (N,E),  where  N is  a set  of 
compound  nodes  and  E is  a set  of  compound  edqes.  Each 
compound  node  has  the  form: 

A ( 1 , u , c ) 

where  A is  the  node  (variable,  or  array)  name,  and  1,  u 
and  c define  the  array  ranqe.  Thus  this  node  represents 
the  set  of  elements:  A ( 1 ) , A(l+c),  ...  A(l+k.c)  , where 
k 

A compound  edge  will  have  the  form: 

A(a.I+b)  <-  B(c.I+d)  for  1 <=  I <=  u 

where  a,  b,  c,  d,  1 and  u are  integer  constants. 

for  a given  array  graph  G = (N,E),  we  can  define  the  r 

underlying  graph  G * (N  ,E  ) which  is  a conventional 

u u u 

■i 

graph  obtained  by  considering  each  of  the  array  components 
as  individual  nodes.  Thus, 

N = (A(l  + I.c)  I 0 <-  I <-  ^(u-n/cj  and  A(l,u,c)€  N} 
u 

I 

and 

E * {A (a. I+b)  <-  B(c.I+d)  I 1 <-  I <>  u 
u 

and  (A(a.I+b)  <-B(c.I+d)  , lOI<-u)  C E) } . 


1_ SlLj. 
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We  will  define  an  art  ay  qraph  G to  be  acyclic  if  its 

underlying  graph  G is  acyclic. 

u 

In  the  above  definition,  we  restricted  variable  names 

to  be  of  dimension  1.  In  a general  case,  a compound  node 

will  be  represented  by: 

111  222  nnn 

A ( 1 ,u  ,c  )(1  , u ,c  ) ...  (1  ,u  ,c  ) 

which  represents  the  set  of  variables: 

111222  nnn 

{ A ( 1 +k  .c  ,1  +k  .c  , ...  , 1 +k  .c  ) I 

1 111 

0 < = k < = [(u  - 1 )/c  J , 

2 2 2 2 

0 <=  k <=  [(u  - 1 )/c  J , 


n nnn 
and  0 <=  k <=  ^(u  - 1 )/c  J } 

Where  n is  the  dimension  of  the  variable.  The 
representation  of  a compound  edge  can  also  be  extended  in 
a similar  manner.  In  the  following,  only  arrays  of 
dimension  1 are  considered.  The  analysis  of  general  array 
graphs  in  which  the  nodes  represent  arrays  of  dimension  > 
lf  is  quite  involved,  and  therefore,  the  general  case  will 
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not  be  discussed.  However,  some  results  on  the  analysis 
of  array  graphs  of  dimension  1 can  be  extended  to  the 
general  case,  and  this  extension  will  be  described  in 
section  9.2.16. 

As  mentioned  earlier,  we  are  interested  in  sequencing 
the  array  graph,  and  we  can  see  that  an  array  graph  G is 
sequenceable  if  the  corresponding  underlying  graph  is 
acyclic. 


In  general,  it  is  difficult  to  say  if  an  array  graDh  is 
acyclic  from  its  initial  representation . This  is  because, 
the  different  nodes  representing  the  same  data  name  may 
have  different  (intersecting  or  non-intersecting)  ranges. 
To  overcome  this  problem,  we  will  introduce  the  concept  of 
an  "induced  graph"  which  is  defined  below. 

9.2.2  Induced  Array  Graph. 


For  a given  array  graph  G = (N , E) , we  construct  a 

bipartite  induced  graoh  G * (N  ,E  ) as  follows:  The  set 

i i i 

of  nodes  N * LUR  is  a union  of  two  subsets,  left  nodes 
i 

(L  nodes)  and  Right  nodes  (R  nodes).  The  set  of  directed 
E,  is  also  partitioned  into  (^j,  a set  of  edges 

going  from  R nodes  into  L nodes;  and  (^) , a set  of  edges 


► 


* 


a set  of  edges 
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going  from  L nodes  into  R nodes.  Ror  each  edge  in  the 
original  graph: 

A(a.I+b)  <-  B(c.I+d)  1 <=  I <=  u 

We  add  to  the  set  L,  the  node  A (a.l+b,a.u+b,a) , and  to  R, 
the  node  B (c . 1+d ,c . u+d , c ) , and  draw  a 01  edge  leading 
from  the  lattei  to  the  former. 

After  the  construction  of  L,  R 5 ^ is  thus  complete, 

we  construct  ^>^by  drawing  02  edge  between  any  lfiL 
and  r£R,  which  have  a non-empty  intersection  (that  is, 
have  identical  variable  names  and  non-emDty  range 
intersection ) . 

Consider  for  example,  the  array  graph: 

A ( 1 ) <-  C 

A ( I ) <-  A(I-l)  For  2<=I <=4 

Its  induced  graph  is  given  in  Figure  9.5  (denoting  L nodes 
by  Squares  and  R nodes  by  Circles). 

We  do  impose  another  restriction  on  our  induced  graphs 
and  that  is,  no  two  L nodes  have  a non-empty  intersection. 
This  implies  that  in  the  original  specification,  no  two 
compound  edges  refer  to  a common  underlying  element.  If 
this  is  not  satisfied  originally,  we  can  always  form 
intersections  of  compound  edges  and  break  each  tanqe  into 
several  sub  ranges  until  this  requirement  is  met. 


C(l,  1,  1) 
(1) 


Figure  9.5.  Induced  Graph  for  the  Assertions 
A(  1)  - c ; 

A(I)  - A(I-l)  For  2 4 1$  4 
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9.2.3  Schedule  for  an  Induced  Gtaph . 

Let  S be  a sequence  of  nodes  that  represents  the 
topological  sorting  of  the  nodes  in  the  underlying  graph 
corresponding  to  an  array  graph  G.  If  any  two  adjacent 
nodes  in  S represent  two  different  elements  of  a variable 
name,  then  we  will  replace  the  two  nodes  in  S by  a 
sequence  of  (1)  a do-element,  (2)  a compound 
representation  of  the  two  nodes,  and  (3)  an  end-element. 

The  do-element  is  of  the  form: 

DO  I = 1 TO  u BY  c; 

The  end-element  is  just  the  keyword:  END ; 
and  the  compound  node  is  of  the  form:  A (I). 

Thus,  if  a do-element  preceeds  a compound  node,  then 
the  range  in  the  do-element  specifies  the  ranqe  of 
elements  that  the  compound  node  represents.  The 
end-element  is  just  used  to  terminate  the  effect  of  the 
do-element  immediately  preceeding  it.  For  example,  if  we 
have  the  following  sequence  of  elements  in  S: 

8(1)  ; A(l)  f A(3)J„B<3)  ; ... 
then,  the  sequence  can  be  replaced  by: 

B ( 1)  ; 

DO  I = 1 TO  3 BY  2; 

A ( I ) ; 

END; 

B ( 3)  ; 
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We  can  apply  similar  reduction  process  to  mote  than  two 
adjacent  elements  of  a same  name  in  the  sequence,  or  to  a 
sequence  of  elements  of  two  different  names,  the  elements 
of  which  occur  alternately  in  the  sequence.  For  examde, 
the  following  sequence  of  nodes: 

C ( 1 ) ; A ( 1 ) ; B ( 1 ) ; A ( 2) ; B(2);  C(2);  ... 
can  be  replaced  by: 

C ( 1)  ; 

DO  I * 1 TO  2 BY  1 f 
A ( I ) ; 

B ( I ) ; 

END; 

C ( 2)  ; . 

We  will  call  such  a reduced  sequence  consisting  of 
do-elements,  end-elements,  compound-nodes,  and  the 
otiqinal  nodes  in  S (that  were  not  teDlaced  by  any 
comoound  node)  as  a schedule.  The  formal  definition  of 
schedule  is  qiven  later  ir.  this  section. 

Actually,  we  are  not  interested  in  obtaining  a schedule 
from  the  underlying  graph  of  the  array  graoh  G.  We  are 
interested  in  obtaining  a schedule  for  an  array  graoh  (or 
its  induced  graph)  directly  from  the  array  graph,  or  at 
worst,  from  an  equivalent  decomposed  version  of  the  array 
graph.  In  the  definition  of  the  schedule  that  follows,  we 
will  use  a modified  form  of  a do-element  which  is  given 
by: 

IF  p THEN  DO  I * 1 TO  u BY  c; 
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where  p is  a boolean  exoression.  This  boolean  expression 
is  used  to  restrict  the  range  of  the  succeeding  compound 
node  from  that  of  preceedina  one.  For  example,  if  S 
contains  the  following  seouence  of  nodes: 

Ml)  J B ( 1 ) ; C ( 1 ) ; A ( 2 ) ; 9(2);  C(2);  A(3);  3(3);  D(l); 

• • • 

Then  the  above  sequence  can  be  reolaced  by  the  schedule: 

DO  I = 1 TO  3 3Y  1; 

Mi); 
b ( I ) ; 

IF  I < 3 THEN  DO  J = I; 

C(J)  ; 

END; 

END; 

In  the  above  example,  the  range  for  J is  restricted  by  the 
condition  in  the  if-clause  of  the  do-element  preceedinq 
C(J)  . 

A schedule  for  an  induced  graph  G can  now  be  defined 

i 

as  follows: 

(1) .  A(d)  for  any  integer  constant  d,  such  that 

A(d,d,l)  6 L,  is  a schedule  element, 

(2) .  A(c.I+d)  for  a variable  I and  constants  c and  d such 

that  A (c.l+d,c.u+d,c)  £ L,  is  an  I-element. 

(3) .  If  L is  a list  of  I-elements  and  schedule-elements 

and  p is  a boolean  exotession,  then 
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IF  p THE*!  DO  I = 1 TO  u BY  c; 

L; 

END; 

is  a schedule  element. 

A schedule  is  then  a list  of  schedule  elements  such 
that  i.  we  unfold  all  loops  (letting  I tanqe  between  1 and 
u in  increments  of  c) , we  obtain  a toooloqical  sortinq  of 
the  undetlyinq  qtaph  of.  the  act  ay  qtaoh  G.  This  implies 
in  particular  that  all  instances  of  nodes  appear  in  the 
schedule,  that  no  instance  appears  twice,  and  that  any 
instance  of  an  edqe  is  represented  in  the  order inq. 

We  will  now  define  an  induced  qraDh  to  be  schedu lable 
if  we  can  obtain  a schedule  for  that  citaph. 

9.2.4  The  Main  Problem. 

Now,  we  are  ready  to  Dose  our  main  Dtoblem,  that  is:  Is 
it  oossible  to  decide  the  schedulability  of  a qiven  array 
graph?  If  not,  can  we  prove  that  the  qraph  is  not 
schedulable? 

Note  that,  if  the  underlying  graph  corresponding  to  an 
array  graph  is  acyclic,  then  that  array  qraph  is 
schedulable.  Since  we  are  not  interested  in  decomposing 
the  array  graoh  into  its  equivalent  underlying  qraph,  we 
will  define  an  induced  graph  to  be  seguenceable  if  the 
corresponding  undetlyinq  graph  is  acyclic.  We  will  try  to 
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obtain  a 

necessary 

and  sufficient 

condition 

for 

sequenceability  of  an 

induced  gtaoh 

(and  not 

the 

under  lying 

graph ) , and 

then  obtain  a 

condition 

fot 

schedulability. 

To  expose  the  complexity  involved  in  scheduling  an 
array  qtaph,  we  will  now  select  four  examoles  of  vatyinq 
complexity. 

Example  1 . 

If  the  induced  qtaph  is  acyclic,  then  the  schedule  can 

be  obtained  easily  usinq  conventional  toooloqical  sotting 

techniques  and  by  interpreting  any  Ml,u,c)  £ L as  the 

schedule  element: 

00  I = 1 TO  u BY  c; 

A ( I ) ; 

END; 


Example  2 . 

If  the  induced  qtaph  contains  cycles,  and  these  cycles 
ate  due  to  the  02  edges,  then  some  of  these  cycles  can  be 
sequenced  usinq  a single  schedule  element  fot  each  cycle. 
Fot  example,  for  the  induced  graph  shown  in  Figure  9.5, 
the  schedule  is  given  by: 

A ( 1 ) ; 

DO  I * 2 TO  4 BY  1; 

A ( I ) ; 

END; 


rahntHHiV 
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Example  3. 

If  the  induced  graph  contains  cycles,  and  these  cycles 

ate  due  to  the  02  edges,  then  it  may  not  be  Dossible  to 

obtain  the  sequence  for  the  graph  using  a single  schedule 

element  for  each  cycle.  In  such  cases,  some  nodes  in  the 

cycle  may  have  to  be  split.  For  examole,  consider  the 

following  array  gtaoh: 

A ( I ) <-  A ( 1+2)  for  1<=  I <=  4 

A ( I ) <-  A(I-l)  + A ( I - 5 ) for  6<=  I <=  10 

A ( 5 ) <-  C 

The  induced  graph  for  the  above  array  graph  is  shown  in 

Fiqute  9.6.  It  is  not  possible  to  sequence  the  above 

graph  without  splitting  the  nodes  connected  by  the  first 

two  compound  edges.  One  possible  solittinq  decomposes  the 

above  graph  into  the  following: 

A < I ) <-  A ( 1 + 2 ) for  I = 1 TO  3 BY  2 

A ( I ) <-  A ( I +2)  for  I = 2 TO  4 3Y  2 

A ( 6 ) <-  A ( 5 ) + A ( 1 ) 

A ( I ) <-  A(I-l)  + A ( 1-5 ) for  I = 7 TO  10 
A ( 5 ) <-  C 
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Now,  this  aitay  graph  can  be  sequenced  as  in  Example  2, 
and  one  possible  sequence  is  qiven  by: 

A(5)  ; 

DO  I = 3 TO  1 BY  -2; 

A(I)  ; 

END; 

A ( 6 ) ; 

DO  I = 4 TO  2 3Y  -2; 

A ( I ) ; 

END; 

DO  I = 7 TO  10  BY  1; 

A ( I ) ; 

END; 


Example  4 . 


In  some  cases,  some  of  the  schedule  elements  have  to  be 

merged  so  as  to  obtain  a valid  sequence  for  the  artay 

graph.  Considet  foi  example,  the  following  aitay  giaoh: 

B ( I ) = A ( I ) FOR  1 < = I < = 8 

A ( 1+1 ) = B ( I ) + 1 FOR  1 <=  I <=  4 

A ( 2 . 1+4 ) = B (2. I ) +1  FOR  1 <=  I <=  2 

A ( 1+6 ) = B ( 3 ) +1  FOR  1 <=  I <=  4 

The  col  responding  Induced  qraph  is  shown  in  Figure  9.7. 

One  possible  schedule  foi  this  graph  is  qiven  by: 
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DO  II  * 1 TO  8 RY  1; 

R(ll)  ; 

IF  II  <=  4 THEM  DO  12  = II; 

A (12+1)  * P ( 1 2 ) + 1; 

EMD; 

IF  (II  >=  2 * II  <=  4 t,  MOD (11,2)  * 0 ) 

THEM  DO  13  = II; 

A ( I 3 + 4 ) = 3(13)  + 1; 

END; 

IF  II  = 3 THEN  DO  14  = II  + 4 TO  II  + 7; 

A ( 1 4 ) = 3(3)  + 1; 

F.ND; 

END; 

Note  that  the  hoolean  expressions  associated  with  the 
inner  do-loops  restrict  the  associated  iteration  to  a 
subrange  of  the  range  of  subscript  associated  with  the 
outer  most  loop. 

From  the  aoove  examples,  we  can  see  that  the 
conventional  topological  sorting  algorithm  cannot  be 
aoolied  directly  to  every  induced  graph.  Some  graphs  have 
to  be  decomposed,  and  in  such  cases,  we  would  like  to 
minimize  the  amount  of  decomposition,  because,  the  more 
compact  the  graph,  the  more  efficient  will  the  schedule 
be.  de  will  not  try  to  present  an  "efficient"  algorithm 
for  the  decomposition  of  "decomposable"  array  otaohs,  but 
will  derive  some  criterions  for  decomposition. 


9.2.5  Nap  Vector  and  Threshold. 


If  two  nodes  i and  i ar'e  connected  by  a 02  edge,  then 
we  will  associate  a Mao  vector  M and  a Threshold  T 

ii  i 1 
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with  that  edge  which  aLe  defined  as  follows: 

M (k)  = 1 + q if  theLe  exists  an  integer  q 

ii 

such  that  1 +k.c  * 1 +a.c 

i i 1 1 

for  0 <«  k <■  I (u  ■*  1 )/c  I 

i i i J 


and 


« 0, 


and  0 <=  q <* 


otherwise. 


T ■ MIN  ( M (k)  - k)  foi  every  k foi  which 

ij  i j 

'1  (k)  > 0. 

ii 


As  an  example,  for  the  induced  graoh  in  Figure  h.5, 

M ■ (1)  , 

23 

T =0, 

23 

M = (2,3,0), 

43 

and  T * MIN  ( 2- 1 , 3-2 ) = 1. 

43 

The  Threshold  and  the  Map  vector  are  useful  in  the 
analysis  of  cycles  that  contain  02  edges.  In  such  cycles. 


From  now  on,  we  will  represent  the  range  of  a numbered 
node  by  using  aporooriate  subscripts.  For  example,  i th 
node  will  be  represented  by: 

A(1  ,u  ,c  ) 
i i i 
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some  of  the  undeLlyim  elements  of  a L node  must  Dtecede 
some  others,  either  by  direct  or  indirect  imDlication. 
The  precedence  can  he  between  elements  of  lower  subscripts 
and  elements  of  hiohet  subscripts,  ol  vice  versa,  leadino 
to  an  ascendinq  or  descending  loon  respectively.  Thus,  in 
the  example  of  Fiqure  9.5,  A(2)  must  precede  A(3),  and 
A(3)  must  precede  A ( 4 ) . If  such  a otecedence  always 
holds,  then  we  can  say  that  the  looo  is  “consistent".  The 
Threshold  measures  the  distance  between  the  closest 
elements  which  should  be  related  by  Drecedence. 

9.2.6  '-lap  Vector  between  two  arbitrarv  nodes. 

If  two  nodes  i and  i are  connected  bv  a sequence  of  (31 
and/or  02  edoes,  then  we  will  associate  a man-vecotr  M 

ii 

between  the  two  nodes  which  is  defined  as  follows: 

(1) .  If  k is  a successor  of  node  i and  k is  in  the 

Path(i,i)  then,  if  i and  k are  connected  by  a 

02  edne,  then  the  map  vector  M is  defined  as 

ik 

in  the  previous  section.  Otherwise, 

d * (1,2,3,  ...  n), 
ik 

where,  n = 1 + ! (u  - 1 )/c  J 

i z.  i 

(2) .  If  k and  1 are  two  nodes  in  the  path(i,i) 
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such  that  oath(ifk)  < path ( i , 1 ) c <=  path(i,i),  then 

'1  is  defined  as  follows: 
il 

If  k is  connected  to  1 by  a 01  edqe,  then 

M = M ; 
il  ik 

otherwise , 

M (q)  = M (M  (q)  ) , if  m (q)  > 0 
il  klik  ik 

= 0,  otherwise. 

for  all  q in  the  tame:  1 <=  a <=  n. 

If  there  are  more  than  one  oaths  between  the  two  nodes, 
then  the  mao-vector  is  superscripted  with  the  seouence  of 
nodes  that  represents  the  appropriate  oath. 

As  an  example,  for  the  induced  qraoh  shown  in  Fiaure 
9.7, 

M * (1,2, 3, 4, 0,0, 0,0) 

23 


Path(i,j)  denotes  a path  from  vertex  i to  vertex  i and  is 
defined  as  the  sequence  of  edaes: 

(i,vl)  (vl ,v2)  ...  ( vn  , i ) 

or  by  the  sequence  of  nodes:  i vl  v2  ...  vn  i 

where,  vl,  v2,  ...»  vn  are  the  nodes  in  the  oath  from 
vertex  i to  vertex  ■}.  Also,  if  one  oath  is  contained  in 
another,  then  one  of  the  relational  operators  is  used  to 
indicate  such  relation.  Therefore,  in  the  above  example, 
path(i,v2)  < path(i,i). 
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M = 
25 

(0,1, 0,2, 0,0, 0,0) 

M = 

41 

(2, 3, 4, 5) 

M 

62 

( 6 , R ) 

2341 

M 

21 

= (2, 3, 4, 5, 0,0, 0,9) 

2561 

M 

21 

= (0,6, 0,8, 0,0, 0,0) 

• • 

and 

• 

so  on  • 

M determines  the  precedence  between  different  elements 

ij 

of  the  two  nodes.  It  also  indicates  that  if  the  q th 
element  of  M is  non  zero,  then  there  exists  a path  in 

ij 

the  undei lyinq  qraoh  from  the  node  reoresented  by  the 
q th  element  of  node  i,  that  is, 

A(1  ♦ c . (q-1) ) 


to  the  node  leptesented  by  the  M (a)  th  element  of  node 


j,  that  is, 

13(1  + c .(*  (a)  - 1)) 

1 j il 
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9.2.7  Self  Map  Vector. 


For  a node  i in  a cycle  of  an  induced  qraoh,  we 


define  the  self  map  vectot  0 as  eaual  to  M , where 


x is  the  seauencc  of  nodes  (oath(i,i))  in  the  cycle.  As 
in  the  case  of  map  vectot,  if  there  are  more  than  one 
cycles  throuqh  the  node  i,  then  we  will  distinauish 
different  self  man  vectors  by  usinq  appropriate 
superscripts. 

As  an  example,  for  the  induced  qraoh  shown  in  Fiqure 
9.7,  we  have: 

12341  2341 

0 = M =(2,3, 4, 5, 0,0, 0,0) 

1 21 

12561  2561 

0 = M = (0,0, 0,8,0, 0,0,0) 

1 21 

0 = (2, 3,4,0) 

3 


As  in  the  case  of  the  mao  vector,  elements  of  the  looo 

mao  vectot  Q determines  the  precedence  between  different 
i 

elements  of  node  i.  In  fact,  if  ? (q)  is  non-zero,  then 

i 

it  means  that  Q (q)  th  element  of  node  i depends  on  the 


q th  element  of  node  i.  Therefore,  for  the  cycle  to  be 
consistent,  the  followim  relation  must  be  true  for 


■ 
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evetv  node  i in  the  cycle: 

0 (o)  > u for  eveiy  inteoei  o for  which  0 (a)  > 0 

i i 

In  the  induced  qtaoh  of  Fiquie  9.7,  all  the  three  cycles 

satisfy  the  above  condition,  and  therefore  aie  consistent. 

9.2.8  Representation  of  ^ag  and  Sjelf  hap  Vectors . 

If  V lentesents  a set  of  elements  that  represents  a mao 
vectoi  or  a self  man  vectoi,  then  its  non-zero  elements 
ate  either  monotonical ly  increasinq  or  decreasinq. 
Therefore,  we  can  represent  such  a vector  usinq  two 
triolets  as  qiven  below: 

V=  fl  ,u  , c ] (1  ,u  , c ] 

x x x y y y 

wher  e- 

1 is  the  smallest  inteqei  foi  which  V(1  ) > 0, 

x x 

u is  the  lamest  inteqei  for  which  V(u  ) > 0, 
x x 

c * I (u  - 1 ) /nj  if  n * the  number  of  non-zero 

X *■  X X 

elements  in  v i3  qreatei  than  1; 

a 1,  otherwise, 

1 * V ( 1 ) , 

y x 

u * V ( 1 ) , 
y X 


330 


and  finally, 

c = I (u  - 1 ) /n  _[  if  n > 1 , 

y y y 

= 1,  otherwise. 

For  example,  the  vector  (4,0, 2,0)  will  he  represented 
by:  f 1 , 3 , 2 ) [ 4 , 2 , -2 1 ; wheteas,  the  vector  ( 0 , 0 , 1 , 2 , 3 , 4 , 0 ) 
will  be  represented  by:  (3,6,1)11,4,11. 

As  another  examole,  consider  the  induced  oranh  shown  in 
Fiqute  9.7.1.  The  man  vectors  associated  with  the  02 
edqes  and  the  self  map  vector  associated  with  node  (1)  of 
the  qraoh  in  this  fiqute  is  shown  in  Fiqure  9.7.2.  These 
vectors  in  the  6-tuole  notation  (described  above)  are 
qiven  by: 

M = (1,4,31  (4, 2, -21 
23 

M = (1,4,1)  (2,3,11 
41 

and  0 = (1,4,31(5,3,-21 

1 

With  the  above  definitions,  we  can  now  start  out 
analysis  of  induced  qraph. 


B(  1 » 5 , 1) 

Cl) 


A(  10 , 2 , - 2) 
(2) 


A(l,10,3) 

(3) 


B ( 2 , 5 , 1) 
(4) 


Figure  9.7.1.  A Simple  Induced  Graph. 


A( 10 , 2 , - 2) 

(2)  . 


(5,  0,0, 3,0) 

(4,  0,0, 2,0) 
- (10,8,6,4,2) 


m234 1 “Ql 

Mo  i 


A(1 , 10 ,3)  - (1,4,7,10) 

j ! 

j (2, 3, 4, 5)  = M41 

b(2 ,5 ,1)  - (2, 3, 4, 5) 

(4)  \ \ V \ 


\\\\ 


UB(1,5,1)  - (1,2, 3, 4, 5) 

(1) 

Figure  9.7.2.  Map  and  Self  Map  vectors  for  the  Graph 
in  Figure  9.7.1. 
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9.2.9  Consistency  of  Cycles  in  an  Induced  Graph.  i 

f As  discussed  in  the  preceedinq  section,  if  for  every 

node  i € R in  a cycle  of  an  induced  qtaph,  the  relation 

0 (k)  > k is  satisfied  for  all  k for  which  0 (k)  > 0, 
i i 

then  there  exists  orecedence  relations  between  elements  of 

lower  subscripts  and  elements  of  hiqher  subscripts,  or 

vice  versa.  Therefore,  such  a cycle  can  be  sequenced;  and 

so,  the  followinq  theorem  is  proven: 

Theorem  9.1.  A cycle  in  an  induced  qtaph  is  consistent 

(or  sequenceable)  if  for  every  node  i ( R in  the 

cycle,  Q (k)  > k for  all  k for  which  0 (k)  > 0. 
i i 

We  can  also  obtain  a stronqer  condition  for  consistency 

usinq  Threshold  rather  than  self  map  vector.  This 

condition  is  given  in  the  followinq  Theorem: 

Theorem  9.2.  A cycle  in  an  induced  qtaph  is  consistent 

(or  sequenceable)  if  the  sum  of  thresholds  on  all  its 
(92  edges  is  greater  than  zero. 

Proof : Consider  two  nodes  i and  j in  a cycle  that  are 

connected  by  a (92  edqe.  Now, 

M (k)  - k + M (k)  -k  for  l<-k<-  I (u  -1  )/c  I 
ij  ij  L i i iJ 

And,  by  using  the  definition  of  Threshold, 


I 
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M (k)  >=  k f T 

ii  ii 

(K)  - k > = T 

ii  ii 

de  can  extend  the  same  teasoninq  foL  two  nodes  i and  i 
connected  by  a sequence  of  01  and  02  edoes,  and  will 
qet : 

M (k)  - k >=  T 

ii 

where  T is  the  sun  of  thresholds  on  all  02  edqes  in  the 

path(i,i).  Therefore,  we  qet: 

n (k)  - k >=  T 
ii 

oi 

0 ( k ) - k >=  T 
i 

Therefore,  if  T is  qreater  than  ^eio,  then  ° (k)  - k > 0, 

i 

and  usinq  theorem  9.1,  the  cycle  is  consistent. 

'JED. 

Note  that,  the  conditions  in  theorems  9.1  and  9.2  are 
equivalent  if  the  cycle  contains  only  one  02  edqe. 
However,  the  condition  for  consistency  in  theorem  9.1  is 
weaker  than  the  condition  in  theorem  9.2  for  a qeneral 
case.  In  other  words,  there  exists  cycles  which  are 
consistent  with  resoect  to  theorem  9.1,  but  tne  condition 


334 


I 

\ 

l 

I 

t 

r 


in  theorem  9.2  does  not  hold.  For  example,  for  the 

following  array  graph, 

A ( 1+9)  <-  B ( I ) 1 <*  I <*  12 

3(1-3)  <-  A ( I ) 8 <=*  I < = 20  BY  2 

the  induced  graph  is  shown  in  Fiqure  9.8.  For  this  qraph, 

M * (2, 0,3, 0,4, 0,5, 0,6, 0,7,0) 

23 

T » MIN( 2-1, 7-11)  = -4 
23 

M = (5,7,9,11,0,0,0) 

41 

T - MIN ( 5-1 , 11-4 ) = 4 
41 

Q = (7,0,9,0,11,0,0,0,0,0,0,0) 

1 

Therefore,  Q (k)  > 0 for  all  k for  which  Q (k)  > 0; 

1 1 

and  so,  the  cycle  is  consistent  from  theorem  9.1. 

However,  the  consistency  condition  in  theorem  9.2  is 

not  satisfied,  because,  the  sum  of  thresholds  associated 

with  all  the  02  edges  in  the  cycle  is  zero. 


I 

['• 


Figure  9.8.  Induced  Graph  for  the  Assertions: 

A(I+9)  - B(I) ; For  1 gl  412 

B(I-3-)  - A(I)  ; For  I - 8 TO  20  BY  2 
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Though  the  condition  foe  consistency  qiven  in  theorem 

9.1  is  weaker  than  that  of  theorem  9.2,  it  is  easier  to 
implement  the  latter  than  the  former.  Also,  for  the 
majority  of  cases,  the  consistency  condition  of  theorem 

9.2  holds.  The  complexity  involved  in  comouting  a self 
map  vector  will  be  exDlained  in  later  sections.  In  the 

discussions  that  follow,  we  will  use  the  (weaker)  \ 

condition  of  theorem  9.1. 

The  condition  in  Theorem  9.2  is  useful  for  another  j 

reason.  Usinq  this  condition,  we  will  be  able  to  derive  a 
condition  for  schedulabil ity.  This  will  be  described  in 
the  later  sections  of  this  chapter. 

Note  that  the  consistency  condition  (of  Theorem  9.1)  is 
not  a necessary  condition  for  the  sequenceabil ity  of  a 

4 

cycle  in  an  induced  graph.  As  an  example,  consider  the  « 

induced  graph  shown  in  Figure  9.6.  For  this  graph, 

M * (0,0, 1,2) 

4 3 1 

343 

Q * (0,0, 1,2) 

3 

1 

Therefore  the  cycle  is  inconsistent  according  to  theorem 
9.1.  But  the  cycle  (or  the  graoh)  is  sequenceable  as  was 
shown  in  section  9.2.4.  Let  us  now  try  to  analyze  the 


array  graphs,  which  consist  of  inconsistent  cycles,  and 
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obtain  a necessary  and  sufficient  condition  for 
sequenceabil ity  after  decomposinq  some  nodes  in  those 
qraphs  (if  necessary). 

9.2.10  Decomposition  of  nodes  in  the  Array  Graphs. 

If  a cycle  in  an  induced  qraph  is  inconsistent,  then 
one  way  to  test  the  sequenceability  of  the  cycle  is  to 
test  the  sequenceability  of  the  underlyinq  qraph 
cor respondinq  to  the  cycle.  Such  a test  is  laborious,  and 
is  not  interesting.  However,  it  is  possible  to  decompose 
the  cycle  and  test  its  sequenceability  using  a more 
efficient  algorithm.  In  the  following,  two  criterions 
used  to  decompose  inconsistent  cycles  in  an  induced  qraph 
are  described. 

9.2.10.1  Decomposition  Criterion  ( 1 ) . 

If  a node  i€R  is  contained  in  a cycle  and  D denotes 

i 

the  set  of  elements  reoresented  by  the  node,  that  is, 

D * (1  ,1  +c  ,1  +2.c  , ...,  1 +n.c  ) 
i i i i i i i i 

where  n ■ I (u  - 1 )/c  J 
L i i i 

and  if  S denotes  the  set 
i 

S » (D  f\  (D  UD  U ...  U D )) 
i i ol  o2  on 
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where  pi,  p2,  . ..,  on  ate  the  02  predecessors  of  node 

i,  that  are  not  contained  in  any  of  the  cycles  through  the 

node  i,  then  the  set  S must  contain  the  leading  elements 

i 

of  the  set  D ; otherwise,  the  node  i (and  all  the  related 
i 

nodes)  must  be  decomposed  till  the  above  condition  is  met. 

Loosely  speakinq,  the  above  criterion  restricts  every 

source  variable  node  of  an  assertion  to  contain  all  the 

"computable"  elements  (elements  that  are  obtained  from 

outside  the  cycle,  or  the  "external  stimuli"  to  the  cycle) 

to  be  the  elements  with  lowest  or  highest  subscripts 

according  as  the  increment  associated  with  the  subscript 

is  positive  or  negative  respectively. 

If  the  set  S represents  the  trailing  elements  of  the 
i 

set  D (rather  than  the  leadinq  elements) , then  no 
i 

decomposition  is  necessary,  but  the  increment  associated 

with  the  node  i (and  all  the  related  nodes)  must  be 

reversed.  This  reversal  rotates  the  elements  of  the 

set  D , and  therefore,  makes  the  set  S to  represent  the 
i i 

leading  elements  of  set  D . 

i 


H 11(1  Hi.  IM.M.I  T n HI 

v fcjf*  ,.*v  A 

- — - - * i . ..i.  • 

: - -- 
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The  crudest  way  to  define  the  decomposition  mentioned 
in  the  above  criterion  is  to  SDlit  the  node  i (and 
accotdinqly  all  the  related  nodes)  into  two  nodes  that 
represent  the  followinq  sets  of  elements: 

1 

D = S 

1 i 

2 

and  D * D - S 

i i i 

If  any  of  the  above  two  sets  cannot  be  represented  by  a 
sinqle  node  (that  is,  if  they  are  not  monotonically 
increasinq  or  decreasinq  sets  of  numbers) , then  those  sets 
have  to  be  further  decomposed  into  smaller  subsets  till 
each  subset  can  be  represented  by  a sinqle  node.  This 
crude  way  of  decomposition  will  sometimes,  split  some  of 
the  nodes  into  too  many  nodes,  and  so,  some  refinement 
must  be  used.  Such  refinements  will  be  described  later. 

We  will  now  apply  the  above  decomposition  criterion  to 
the  induced  qtaoh  shown  in  Fiqute  9.6.  The  node  (3)  in 
the  graph  can  be  decomposed,  and  one  possible 

decomposition  is  given  by: 

A(5,4,-2) 
and  A(5,3,-2) 

The  decomposed  graph  is  shown  in  Figure  9.9.  All  the 


' if  ’ t 
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cycles  in  this  qraph  are  consistent  w.r.t.  the  theorem 

9.1. 

The  decomposition  criterion  described  above  is  not 

sufficient  to  obtain  a necessary  and  sufficient  condition 

for  sequenceability.  For  example,  consider  the  induced 

graph  shown  in  Figure  9.10.  This  graoh  is  indecomposable 

using  criterion  (1).  Also,  for  this  graph, 

M = (1,0, 2,0) 

34 


M = (4,2) 
52 


and  therefore, 

Q =(4,0, 2,0) 
2 


and  so,  the  cycle  is  inconsistent.  But  the  induced  graph 

is  schedulable,  and  one  possible  schedule  for  the  graph  is 

given  by:  J 

f i 

A ( 6 ) ; 

DO  I = 6 TO  8 BY  2; 

'MD  ; 

A (15-1); 

IF  I = 6 THEN  DO;  t 

A (8) ; . ; 

END; 

END; 

DO  I = 7 TO  9 BY  2; 

3(1)  ; 

END; 


(7) 


Figure  9.10.  Induced  Graph  for  the  Assertions: 
A(6)  = 10; 

B (I)  = A(I)  ; For  6(I«9 

A(8)  = B(6) ; 

A(15-2. 1)  = B C 2 . 1) ; For  3^154 
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In  the  above  schedule,  the  nodes  (2)  and  (3)  are 
split  into  two  sets  of  nodes  A(6,8,2),  A(7,9,2)  and 
B(6,8,2),  A(7,9,2)  respectively. 

Theorem  9.3.  If  the  self  map  vector  associated  with 

a node  i £ R in  a cycle  of  an  induced  qraph  is  given  by: 

0 = [1  ,u  ,c  1 [1  ,u  ,c  ] 

i xxx  y y y 

where  both  1 ' and  c are  1;  and  if  the  cycle  is 
x x 

indecomposable  w.r.t.  the  criterion  (1),  then  the  cycle  is 

sequenceable  iff  for  every  node  i €.  R in  the  cycle, 

Q (q)  > q for  all  q for  which  0 ( q ) > 0. 
i i 

Proof : 

If  both  1 and  c are  1 then  it  means  that  0 (P)  > 0, 
xx  i 

for  all  p in  the  range  0 < p <=  u . Also,  if  there 

x 

exists  an  inteqer  q for  which  q >=  0 (q)  > 0,  then 

i 

either,  (1)  1 =1, 

y 

or,  (2)  c <0. 

y 

In  the  first  case,  the  cycle  is  obviously  inconsistent 
and  not  sequenceable,  because,  the  first  element  of  the 
node  is  its  own  predecessor,  so  the  underlying  graph 


1 


| 

1 


I 
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itself  is  not  acyclic. 

i In  the  second  case,  the  ptecedence  is  between  the 

elements  of  higher  index  to  that  of  lower  index  (or  vice 
versa  if  increment  is  < 0) , even  though  the  leading 

elements  of  the  node  were  computable.  Even  in  this  case, 
the  cycle  cannot  be  decomoosed  to  a set  of  consistent 
cycles.  In  other  words,  at  least  one  element  of  the  node 
is  its  own  predecessor,  directly  or  indirectly,  implying 
that  the  underlying  graDh  itself  is  not  acyclic. 

OED. 

From  the  above  theorem,  we  can  obtain  another 
decomposition  criterion  which  is  described  in  the 
following . 


9.2.10.2  Decomposition  Ct iter  ion ( 2) . 

If  for  a node  i £ R in  a cycle  of  an  induced  graph, 

0 = [1  ,u  ,c  ] [1  ,u  ,c  1 and  1 > 1 or  c >1,  then 
i xxxyyy  x x 

decompose  the  node  i (and  all  the  related  nodes)  into  the 

nodes  represented  by  the  following  set  of  elements: 

1 

D = A (1  + k.c)  I 0 <=  k < 1 -1 

1 x 

2 

D * A ( 1 + k.c.c  ) I 
i x 

1 -1  <*  k <*[(u  — ( 1 +c . ( 1 -l))/c  .cj 

X L XX 


— 1 


) 

> 

t 

i 

i 
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3 2 1 

and  D =0  - (D  + D ) 

i i i i 

whete- 

the  node  i is  represented  by  A(l,u,c). 


As  an  example,  consider  the  directed  araph  shown  in 
Figure  9.10.  For  this  qraoh, 

0 = (4, 0,2,0)  = [1,3,2] [4, 2, -2] 


Therefore,  usinq  the  criterion  (2),  the  node  2 can  he 
decomposed  into  nodes  that  represents  the  following  sets 
of  elements: 

1 

D * () 

1 

2 

D = (6,6+2)  = (6,9) 


3 

and  D = (6, 7,8,9)  - (6,8)  » (7,9) 
i 

The  decomposed  graph  is  shown  in  Figure  9.11. 


* 


I 

l 

( 

I 

l 


Figure  9.11.  A Decomposed  Graph  of  Figure  9.10. 
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We  will  now  say  that  an  induced  graph  is  indecomposable 
if  the  decomposition  criterions  (1)  and  (2)  do  not 
decompose  the  qtaph.  Therefore,  Theorem  9.3  can  be 
rephrased  as  follows: 

Theot em  9.4. 

If  an  induced  graph  is  indecomposable,  then  it  is 

sequenceable  iff  for  every  node  i £ R of  a cycle  in  the 

graph,  Q (q)  > q for  all  q for  which  0 (q)  > 0. 
i i 

9.2.11  The  ASPLITl  Procedure. 

The  ASPLITl  procedure  is  used  to  decompose  a node  of  an 
induced  qtaoh  using  the  decomposition  criterion  (1) 
described  in  section  9.2.10.  The  algorithm  A9.1  describes 
the  functions  of  the  routine  ASPLITl.  The  steps  1 thru  9 
in  the  algorithm,  obtain  the  first  (XI)  and  the  last  (X2) 
"computable"  elements  of  the  node,  and  the  first  (Yl)  and 
the  last  (Y2)  "non-comoutable"  elements  of  the  node. 

If  XI  < Yl  and  X2  < Yl,  then  all  the  computable 
elements  ate  the  leading  elements  of  the  node;  and  so,  the 


I 

\ 

) 

l 

l 

r 

r 


r 

■ 


i 


"computable"  elements  of  a node  i € R,  are  those  elements 
that  are  contained  in  those  02  predecessors  of  i which  are 
not  contained  in  any  of  the  cycles  through  i;  and 
"non-computable"  elements  are  those  elements  that  are  not 
computable. 
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decomposition  criterion  (1)  cannot  be  aDplied  (step  10). 

If  the  computable  elements  of  a node  are  the  trailinq 
elements  of  that  node,  then  the  increment  is  negated  (step 
11).  The  negation  is  also  performed  if  the  "distance" 
between  the  first  element  and  XI  is  greater  than  the 
distance  between  the  last  element  and  X2.  An  example  of 
the  latter  case  is  the  node  (3)  in  the  induced  graph  of 
Figure  9.6.  For  this  node: 

XI  = 3 = X2 , 

Y1  * 1,  and 
Y2  = 4. 

In  this  case,  the  increments  associated  with  the  nodes  (3) 
and  (4)  are  neqated,  so  that  their  reoresentations  ate 
changed  to  A(6,3,-l)  and  A(4,l,-1)  respectively. 

If  it  is  possible  to  decompose  a node,  then  one  of  the 
three  techniques  for  decomposition  described  in  steos 
(12),  (16)  and  (19)  is  used. 


ALGORITHM  A9.1:  ASPLITl(I). 

Function:  Decomposes  the  node  I of  an  induced  graph  usinq 
the  decomposition  criterion  (1)  described  in 
section  9.6.1. 

Stepl:  Set  XI,  X2,  Yl,  Y2  » 0. 

Step2:  For  each  02  predecessor,  K,  of  node  I,  perform 
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steps  (3)  thtu  (9)  . 

Step3:  If  K is  contained  in  a cycle  through  I,  then  goto 

step  7. 

Step4:  Let  M = [1  ,u  ,c  ][1  ,u  ,c  |.  If  Xl  = 0 then  set 

ki  xxx  xxx 

XI*  MIN ( 1 , u );  otherwise,  set  XI  * MIN(X1,1  ,u  ). 

y y y y 

Step5:  If  X2  * 0 then  set  X2  = MAX(1  ,u  );  otherwise,  set 

V y 

X2  * MAX ( X2 , 1 ,u  ). 

y y 

Step6:  Goto  step  9. 

Step7:  If  Y1  * 0 then  set  Y1  * MIN(1  ,u  );  otherwise,  set 

y Y 

Y1  * MIN ( Yl , 1 ,u  ). 

y y 


Step8:  If  Y2  * 0 then  set  Y2  * MAX(1  ,u  );  otherwise,  set 

y y 

Y2  = MAX ( Y 2 , 1 ,u  ) . 

y y 

Step9:  Continue. 

SteplO:  If  XI  < Yl  and  X2  < Yl  then  no  decomposition,  so, 
return. 

stepll:  If  (X2  > Y2  & Xl  > Y2)  | (Y2  > X2  ft  Y2-X2  < Xl-Yl) 


Then  negate  the  increment  associated  with  this 
node  and  all  the  related  nodes.  At  end,  return. 
Stepl2:  If  Xl  * 1 then  split  the  node  I into  two  nodes 
that  represent  the  following  sets  of  elements: 


« J-L 


I 

i 

( 

I 

l 


I 

— ■ ■■■  I — ■ — - | 

) 

l 

I I 

• { 
t 
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1 

D = A ( 1 ),  A ( 1 +c  ),  ...,  A ( 1 +(Y1-2).C  ) 

1 i i i i i 

and 

2 

D = A ( 1 +(Yl-l).c  ),  A ( 1 +Y1.C  ), 
i i i i i 

A ( 1 + k.c  ) 

i i 

where  k =^(u  - 1 )/cJ. 

At  end,  return. 

Stepl3:  For  each  cycle  throuqh  I,  perform  steps  14  thru 
18. 

Stepl4:  Compute  0 for  the  cycle;  and  if  0 (XI)  > 0 

i i 

and  0 (XI)  > XI  + 1 then  do  steps  15  thru  17. 
i 

Stepl5 : Let  J * 0 (XI ) — XI . 

i 

Stepl6:  Decompose  the  node  I into  two  nodes  representinq 
the  following  sets  of  elements: 

1 

0 * A ( 1 +(Xl-l).c  ),  A ( 1 + (Xl+J-1) .c  ),  ..., 

i i i i i 


A ( 1 +(Xl+k.J-l) .c  ) 
i i 


where  k *l((u  -1  ) /c  ♦l-XD/J 
L i i i 

and 
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i 

2 1 
D * D - 0 . 

i i i 

2 

Decompose  D if  XI  jrf  J or  XI  * J & J > 2. 


At  end,  return. 

Steol7:  Continue. 

Stepl8:  Continue. 

Stepl9:  Decompose  the  node  I into  two  nodes  represented  by 
the  following  sets  of  elements: 

1 

D * A ( 1 ),  A ( 1 +c  ),  ...,  A ( 1 +(Xl-2).c  ) 

1 i i i i i 

and 

2 1 
D * D - D . 

i i i 

Step20:  Return. 


9.2.12  The  ASPLIT2  Procedure.  ! 

The  procedure  ASPLIT2  is  used  to  decompose  a node  of  an 
induced  graph  using  the  decomposition  criterion  (2) 
described  in  section  9.6.1.  The  algorithm  A9.2  describes 
the  functions  of  the  routine  ASPLIT2. 
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ALGORITHM:  A9.2:  ASPLIT2(I). 

Function:  Decomposes  the  node  I in  an  induced  graph  using 
the  decomposition  criterion  (2)  described  in 
section  9.5.1. 

Calls : 

Called  by: 

Stepl:  For  each  (32  predecessor,  K,  of  node  I,  perform 

SteDS  (2)  thru  (6) . 

Step2:  Let  M =11  ,u  ,c  111  ,u  ,c  ] 

ki  xxx  y y y 

Step3:  If  K is  not  contained  in  any  cycle  through  I,  then 

goto  step  6. 

Step4:  If  1 > 1 or  c > 1 then  do  step  5. 

x x 

Step5:  Decompose  the  node  I into  three  nodes  the  elements 

of  which  are  represented  by: 

1 

D * A (1  + k.c)  | 0 <=  k < 1 -1 

1 x 

2 

D * A ( 1 + k.c.c  ) I 
i x 

1 -1  <*  k <*| (u  - ( 1+c. (1  -l))/c  .c  J 

X XX 

3 2 1 

and  D « D - (D  +0  ) 

i i i i 

where - 

the  node  i is  represented  by  A(l,u,c). 
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At  end,  return. 
Steo6:  Continue. 

Step7:  Return. 


9.2.13  Comments  on  the  Decomposition  of  Array  Graphs. 

The  decomposition  algorithms  described  in  earlier 
sections  may  not  be  too  efficient  for  general  array 
graphs.  Some  additional  refinements  can  be  aDDlied  to 
both  of  them.  One  refinement  to  the  decomposition 
criterion  (1)  is  given  below. 

Consider  two  cycles  thru  a node  i Q,  R in  an  array 
graph.  If  one  of  the  cycles  contains  only  one  02  edge, 
and  the  other  more  than  one,  then  the  decomposition 
criterion (1)  can  be  applied  to  the  sub-qtaph  of  the  array 
qiaph  which  consists  of  only  the  former  cycle.  For 
example,  consider  the  directed  qraDh  shown  in  Figure  9.12. 
In  this  fiqure,  the  cycle  1-2  is  inconsistent,  as  it  does 
not  satisfy  the  condition  in  theorem  9.2.  If  we  apply  the 
above  refinement  for  the  decomoosition  cr iter  ion (1) , the 
graph  in  Figure  9.12  will  be  changed  to  the  qraph  shown  in 
Figure  9.13.  The  cycle  1-2  in  Figure  9.13  is  now 
consistent. 


I 


A(3,  10,  1) 
(1) 


Figure  9.12. 


Figure  9. 13. 
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9.2.14  S equenceability  and  Schedulability . 

So  far,  we  have  discussed  the  condition  for 
sequenceability.  Now  we  will  try  to  obtain  an  alaorithm 
for  schedulnq  a sequenceable  qraph.  We  will  call  an 
induced  qtanh  schedulable  if  we  can  obtain  a schedule  for 
that  qraph.  It  can  be  easily  seen  that  not  al 
sequenceable  qtaphs  are  schedulable.  However,  we  will 
obtain  a sufficient  condition  for  schedulability , and  try 
to  find  a decomposition  criterion  to  decompose  a 
sequenceable  qtaDh  into  a schedulable  araoh. 

To  illustrate  that  not  all  sequenceable  qtaohs  are 
schedulable,  consider  the  followino  array  qtaDh: 

Al:  A ( 1 ) =0; 

A2 : A ( 2. I ) * A ( I ) + 1 FOR  1 <»  I <«  50 

A3:  A ( 2 . 1-1 ) » A ( 2. I ) + 2 FOR  31  <«  I <«  50 

A4 : A (2. 1+1 ) * A (2. 1+5)  +4  FOR  1 <=  I <«  29 

The  cot tespondinq  induced  qraph  is  shown  in  Fiqute  9.14. 
If  we  apply  the  modified  decomposition  criteLion(l)  (see 
section  9.2.12),  the  induced  qraph  shown  in  Fiqure  9.14 

will  be  chanqed  to  the  qraph  shown  in  Fiqure  9.15.  For 

this  qraph,  the  first  decomposition  criterion  can  no 
lonqet  be  aoolied.  However,  second  decomposition  can  be 
applied  because: 


( 

I 

l 

r 


► 

•i 

I 


9.14.  Induced  Graph  for  the  Assertions: 

A(  1)  - 0} 

A(2. 1)  - A(I)  + 1;  For  14  r $ 50 

A(2. 1-1)  - A(2.I)+2;  For  31$  E « 50 

A(2. 1+1)  - A(2. 1+5)+4;  For  1$  1$  29 
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A(l,  1,1) 

(1) 


2 


A Transformation  of  Graph  in  Figure  9.14. 
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Q = f 15, 15, 11 [2,2,11 

6 

Therefore,  after  applying  the  second  decomposition 
criterion,  we  get  the  giaoh  shown  in  Fiquie  9.16.  All  the 
cycles  in  the  graph  of  Fiquie  9,16  ate  consistent,  and 
thetefoie,  the  qiaph  is  sequenceable  accordinq  to  theorem 
9,3.  However,  there  is  no  easy  way  to  obtain  a schedule 
for  even  the  graoh  in  Fiaure  9.16,  Observation  of  the 
graph  shows  that  a possible  sequence  of  computing  the 
elements  of  the  array  is  given  by  the  following  sequence 
of  indices: 


1,2,4,8,16,32,64, 

using 

* 

CM 

< 

63 

using 

A3, 

59,55,51,.. .,31, ...7, 3, 

usinq 

A4 , 

6,14,22,... ,62,.. .,94, 

usinq 

A2, 

a 

61 

using 

A3, 

- 

57,53,49,  ...  5 

using 

A4 , 

and  so  on. 

Therefore,  the  graph  has  to  be  decomposed  further  so  as  to 
schedule  it.  A decomposition  criterion  to  decompose 
sequenceable  qraphs  into  schedulable  graphs  is  derived  in 
the  following  sections. 


Figure  9.16.  A Decomposition  of  Graph  in  Figure  9.15 
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9.2.15  A Sufficient  Condition  for  Schedulabi 1 ity . 

— * " ' **  1 — — 1 1 " 1 " - ■ 

It  can  be  seen  that,  if  the  sum  of  thresholds 

r 

associated  with  all  02  edqes  of  a cycles  is  > 0 then  the 

recursion  represented  by  that  cycle  can  be  scheduled. 

(This  statement  is  analoqous  to  the  statement  that  in  a 
serial  electric  circuit  consistinq  of  a diode  between  two 
nodes  A and  B,  there  will  be  conduction  if  the  sum  of 
voltaqes  between  nodes  A and  B is  Positive).  This  is 
because  of  the  followinq:  There  exists  at  least  one  node 
in  the  cycle,  such  that,  if  we  start  with  that  node  and 
traverse  the  cycle,  and  sum  all  the  thresholds  on  the 
encountered  02  edqes,  the  sum  is  never  neqative.  This 
means  that,  if  we  start  to  qenerate  the  schedule  startinq 
with  that  node,  we  can  obtain  thejchedule  for  that  cycle. 

Therefore,  we  qet  the  followinq  condition  for 
schedulability : 

An  induced  qtaoh  is  schedulable,  if 
the  sum  of  thresholds  associated  with 
all  the  02  edqes  is  positive  for  every 
cycle  in  the  graph. 

Algorithm  A9.3  describes  the  steps  involved  in 

1 

generating  a schedule  for  a sequenceable  induced  qraoh. 

I i 

This  algorithm  will  be  illustrated  by  going  thru  the 


nf  of  ane  nananf  ^ nn  f ha  eaha/^nl  a far  f Ua 
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qtaph  in  Figure  9.7.  Foi  this  example,  START  * 1 and  END  t 

* 1;  and  in  step  1,  3 cycles  ate  obtained.  These  ate: 

(1,2, 3, 4),  (1,2, 5, 6)  and  (1,2, 7, 8).  The  ranqe  (ls,us,cs) 

r 

is  (1,8,1)  in  step  3.  Dutinq  the  analysis  of  the  cycle 

(1,2, 3, 4),  in  steos  (5)  thru  (13),  the  followinq  schedule 

elements  ate  generated: 

DO  Is  * 1 TO  8; 

8 ( Is ) ; 

END; 

IF  Is  <=  4 THEN  DO  13  « Is; 

A ( 1 3+1 ) ; 

END; 

Dutinq  the  analysis  of  the  cycle  (1,2, 5, 6),  the  followinq 

schedule  elements  are  generated: 

IF  Is  >»  2 S Is  <«  4 s MOD ( Is , 2)  » 0 THEN  DO  15  =»  Is; 

A ( 15+4)  ; 

END; 

And,  dutinq  the  analysis  of  the  cycle  (1,2, 7, 8),  the 

followinq  schedule  elements  ate  generated:  t 

IF  Is  >*  3 & Is  <*  6 THEN  DO  17  • Is; 

A ( 17+4 ) ; 

END;  j L 

And,  finally,  an  end-element  is  generated  in  step  14. 




Algorithm  A9.3.  SCHEDUL  (START, END); 

Function:  Generates  the  schedule  for  a set  of  nodes  that 
ate  stronqly  connected.  START  and  END  are  the 


indices  of  the  order  vector,  ORDERV,  representing 


Stepl : 

Step2: 

Step3: 

Step4 : 

Step5: 

Step6: 

Step7 : 
Step8 : 
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the  beginning  and  end  of  the  set  of  strongly 

connected  nodes. 

Find  a list  of  all  cycles  in  the  set  of  nodes 
represented  by  ORDERV (START)  thru  OROERV ( END) . If 
there  are  no  cycles,  then  return. 

Let  SEOUENCED  be  an  array  of  size  IDICT,  and 

initialize  all  the  elements  in  it  to  zeros. 

Let  the  range (Is , us ,cs ) cover  all  the  ranges  of 
the  subscript  Is  associated  with  every  cycle 

obtained  in  step  1.  Also,  generate  the  following 
do-element  for  the  START  node: 

DO  Is  * Is  TO  us  BY  cs; 

For  each  obtained  in  step  1,  perform  steps  (5) 

thru  (13). 

Let  NODES  be  a vector  that  represents  all  the  K 
nodes  in  the  cycle  under  consideration.  For  J * 1 
to  K perform  steps  (6)  thru  (12). 

Let  q = NODES ( J) . If  SEQUENCED (q)  > 0 then  qoto 

step  12. 

Set  SEOUENCED (q)  « 1. 

If  q is  contained  in  L,  then  output  the  compound 
node  a,  and  the  end-element.  At  end,  qoto  steD 
12. 


) 

\ 

I 

9 

l 

f 


n 


* 


Step9:  q contained  in  R.  Therefore,  find  the  boolean 


I 

I 

l 

f 

1 
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expression  p that  restricts  the  range  of  Is  to  the 

i 

range  (la,uq,cq)  which  is  the  ranqe  associated 
with  the  node  q. 

SteplO:  Obtain  the  mapping  function  f that  relates  Iq  and 
Is. 

Stepll:  Output  the  do-element: 

IF  p THEN  DO  Iq  * f (Is)  ; 

Stepl2:  Continue. 

Stepl3:  Continue. 

Stepl4:  Output  an  end-element,  and  then  return. 


9.2.15.1  Decomposition  Criterion  ( 3 ) . 

If  a cycle  in  an  induced  graoh  is  seauenceable , but  the 
sum  of  thresholds  associated  with  that  cycles  is  <*  0, 
then  the  qtaph  is  decomposed  using  the  following 
criterion:  > 

Pick  any  two  nodes  i and  i in  the  cycle  that  are 
connected  by  a 02  edge,  and  for  which  the  threshold  is 
given  by  -T  (where  T is  positive) , and  the  map  vector  is 
given  by: 

i 

1 - (1  ,u  ,c  1 (1  ,u  ,c  J 

ij  x x x y y y 

Now,  apply  one  of  the  followinq  decompositions: 


- 


I 
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(1).  If  | c I > 1 then  decompose  the  node  j (and  all  the 
v 

related  nodes)  into  two  nodes  k and  1,  the  elements 

of  which  are  qiven  by: 

D = 1 + 2.m.c  I 1 <=  m <=  n /2, 

k 1 1 

and  D = D - 0 

1 j k 

where-  (1  ,u  ,c  ) represents  the  set  of  elements 

i i 1 

represented  by  the  node  i and  n is  qiven  by: 


n 

= 

I ( u - 1 )/  c J 

*-  i i i J 

(2).  If  | c 1=1  then  decompose  node  i (and  all  the 

y 

related  nodes)  into  the  nodes  represented  by  the 

following  two  sets  of  elements: 

D = A ( 1 + m.c  ) I 0 < = m < = T 
k i i 

D = D - D 
1 i k 

whet e- 

A ( 1 ,u  ,c  ) represents  the  node  i and  D represents 
i i i i 

the  set  of  elements  represented  by  the  node  i. 

In  the  following,  the  above  decomposition  is  applied 

to  the  graph  shown  in  Fiqure  9.16.  Figure  9.17  shows  the 

resulting  graph  after  aoplying  the  the  first  Dart  of  the 

decomposition  criterion  described  above  to  the  node  2. 
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l 

1 
f 

Fiqure  9.18  shows  the  resulting  graoh  after  applvinq, 
aqain,  the  first  oart  of  the  above  decomposition  criterion 
to  the  nodes  21  and  4 of  Fiqure  9.17.  The  decomposition 
criterion  can  be  applied  further  and  further  to  the  qraoh 
of  Fiqure  9.18  to  obtain  a oraoh  in  which  the  sum  of 
thresholds  on  all  02  edqes  is  positive,  and  then,  qet  a 
schedule  for  the  qraoh  usinq  the  Algorithm  A9.3. 

9.2.16  Variables  of  Dimension  Greater  than  1. 

We  will  now  try  to  extend  the  results  on  array  qtaph 
described  in  the  earlier  sections  to  the  qeneral  case 
where  the  nodes  represent  variables  of  dimension  > 1. 

Again,  we  will  use  the  notations 

111  222  ddd 

A ( 1 , u , c ) (1  , u , c )...  (1  , u , c ) 
iiiiii  iii 

to  represent  an  array  of  dimension  d.  The  subscript  (i  in 

' * F 

the  above  example),  associated  with  the  bounds  1,  u,  and  c 

will  be  used  as  before  to  denote  the  node  number.  We  will 

start  with  some  additional  definitions. 


Figure  9.17.  A Dec anpositicn  of  Graph  in  Figure  9.16. 


(33) 


Figure  9.18.  A Dec  imposition  of  Graph  in  Figure  9.17 
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9.2.16.1  Older  Vector . 

The  order  vector  O associated  with  two  nodes  i and  -j 

i j 

in  a cycle  of  an  induced  qraoh  is  defined  as  follows: 

O = (1,2,  ...,  d)  where  d is  the  dimension 
ii 

of  the  variable  represented  by 
the  node  i, 

O =0  if  j is  a 02  oredecessor  of  node  k, 
ik  ii 

0 = f (0  ) if  j is  a 01  predecessor  of  node  k,  and 

ik  i j 

and  i beinq  in  the  oath(i,k). 

The  function  f used  in  the  above  definition  maps  the 

elements  of  0 to  0 such  that  if  0 (d)  * 0 (q) , 

ij  ik  ik  ii 

then  the  subscript  names  used  for  the  p th  dimension 

of  node  k is  the  same  as  the  subscriot  name  used  in  the 

q th  dimension  of  node  j.  Thus,  an  order  vector  0 

ii 

represents  the  permutation  of  the  subscript  names 

used  in  node  i from  the  subscript  names  used  in  node  j. 

As  an  example,  consider  the  array  qraoh: 

A ( J , I ) * A ( I , J ) For  10104  & 1 O J O 10 

A (J-l  ,K)  * A ( K , J ) For  40  J O 9 & 4 <»  K O 10 

The  correspondinq  induced  qraph  is  shown  in  Fiqure  9.19. 

The  subscript  names  used  in  the  first  and  second 

dimensions  of  the  node  1 are  the  same  as  the  subscript 


I 
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names  user]  in  the  second  and  the  first  dimensions  of 

node  2.  Therefore,  we  have  0 * (2,1).  Similarly, 

12 

0 = (2,1).  And,  bv  definition,  0 = (1,2). 

34  34 

9.2.16.2  Scale  Factor . 

Scale  Factor  F associated  with  the  i th  node  of  a 

1 

directed  qraph  is  a set  of  d elements,  where  d is  the 
dimension  of  the  variable  represented  by  the  i th  node 
and  is  defined  recursively  as  follows: 

f (d)  - 1 
i 

FOR  k = d TO  2 BY  -1 

k k k 

f (k-1)  = f <k)*(l+(u  -1  >/c  ) 
i i i i i 

For  example,  for  the  directed  qrabh  shown  in  figure  8.6, 

F » (1,1)  - F - F 

1 4 5 

F - (3,1)  ■ F 

2 3 


A(  1,  4,  1)  ( 1,  10,1) 
(1) 


Figure  9.19. 


Induced 

Graph  for 

the  Assertions: 

A( J, I ) - 

A( I , J) 

FOR 

4 

fc 

1* 

J# 

10 

A(J-l.K) 

- A(K.J) 

FOR 

4* 

J4 

9 

& 

4« 

K » 

10 

y, 


: l 
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9.2.16.3  Weight  Vector . 

The  Weight  Vector  W associated  with  a 02  edge 

ii 

between  two  nodes  i and  i is  defined  as  follows: 

W = F (0  ) 

ii  i ii 

whete- 

0 is  the  order  vector  associated  with  the  02  edge, 

i j 

and  F is  the  Scale  Factor  associated  with  the  node  j. 
1 

Foi  example,  for  the  induced  graph  shown  in  Figure  8.6, 

W * F * (3,1) 

32  2 

and  W » F = (3,1) 

52  2 


9.2.16.4  Compound  Threshold. 

We  have  seen  in  section  9.2.15,  that  the  threshold 
associated  with  a 02  edge  of  a cycle  is  the  most  useful 
tool  in  obtaining  a condition  for  schedulabil ity  of  the 
induced  graph.  The  concept  of  threshold  can  be  extended 
to  the  general  case.  We  will  call  the  corresponding 
threshold  associated  with  the  02  edge  of  a general  array 
graph  as  Compound  Threshold  (CT) , and  will  define  it  as 
follows: 


I 

I 

i 

I 

• l 

l 
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112  2 d d 

CT  = W . T + W . T + . . . + W . T 
ij  ii  ii  ii  ij  ij  ii 


k 

where,  T is  the  threshold  associated  with  the  k th 

1 j 

dimension,  and  which  is  defined  the  same  way  as  in  section 
1 d 

9.2.5;  and  W , ...  , W ate  the  comoonents  of  the 

ij  ii 

weiqht  vector  associated  with  the  02  edye,  and  which  is 
defined  in  the  otevious  sub-section.  For  example,  for  the 
induced  qraph  of  Fiaure  8.6, 

1 

T =0, 

32 

2 

T =1, 

32 

and  therefore, 

CT  *0+1*1. 

32 

Similarly, 

CT  *1*3+0*  3. 

52 

• ' i 

As  in  the  simole  case,  the  compound  threshold  also 

defines  the  distance  between  the  closest  elements  which 

■ 

should  be  related  by  precedence. 
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9.2.16.5  Condition  for  Schedulability. 

Again,  we  will  extend  the  condition  for  schedulability , 

| 

which  was  derived  in  section  9.2.15,  to  the  qenetal  case. 
That  is. 

An  induced  qraph  is  schedulable,  if 
the  sum  of  compound  thresholds 
associated  with  all  the  02  edqes  is 
positive  for  every  cvcle  in  the  qraph. 

We  will  not  be  describing  the  algorithm  for  schedulinq  a 
general  array  ataph  in  this  dissertation. 


■ 


■fc.  • i • 


■ ■ ‘ - 
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9.3  Consistency  Analysis. 

This  section  describes  the  consistency  analysis 
peL formed  in  the  MODEL  Processor.  As  overviewed  in  Fiqure 
9.3,  consistency  analysis  consists  essentially  of  the 
followinq  4 parts: 

(1) .  Checkinq  for  multiple  definition  any  data  name  or 

any  element  of  a data  name  (Procedure  RICONS8). 

(2) .  Removinq  all  the  tyoe-8*  edqes  in  the  qraoh  that 

cause  cycle  (Procedure  REMOVE8). 

(3) .  Checkinq  the  consistency  of  each  recursion  usina  the 

results  described  in  the  previous  section  (Procedure 


I 


i 

F 

r 


CHKSCON) . 

(4).  Propaqatinq  the  subscripts  from  the  input  nodes  to 
the  output  nodes  in  the  qraoh,  at  the  same  time 
checkinq  for  consistencv  of  subscripts  specified  in 
the  LHS  and  RHS  of  an  assertion. 

The  Procedures  that  oetfoim  the  above  functions  are 
described  in  the  followinq. 

9.3.1  Implementat ion  Limitations. 

Not  all  the  results  derived  in  section  9.2  ate  used  in 
the  implementation  of  the  current  version  of  the  MODEL 

* type-8  edqes  are  the  same  as  the  02  edoes  described  in 
section  9.2. 


or ocessot 


The  items  that  aLe  not  included  in  the  system 


are : 

(1) .  Algorithms  for  splitting  the  assertions.  The 

procedures  ASPLIT1  and  ASPLIT2  described  in  section 
9.2.12  are  not  implemented.  The  algorithm  for 
splitting  the  assertion  usino  the  decomposition 

criterion  (3)  is  also  not  implemented.  Therefore,  if 
the  condition  given  in  section  9.2.15  is  not 

satisfied,  the  graoh  is  repotted  as  not  schedulable. 

(2) .  The  algorithm  SCHEDUL  described  in  the  previous 

section  is  not  implemented  as  it  is  part  of  the 
seguencing  Phase  (see  Chapter  11). 

(3) .  If  recursion  is  specified  on  mote  than  one  dimension 

of  a variable,  then  the  consistency  check  is  not 
oer formed. 

In  addition  to  the  above  limitations,  the  consistency 
check  is  not  performed  when  any  of  the  lower  bound,  upper 
bound  or  increment  associated  with  a node  of  the  graph  is 
not  an  integer. 


9.3.2  The 

Procedure 

R_ICQNS8  (Report 

Inconsistencies)  . 

This  procedure 

checks  to  see  if 

any  data  name  or 

any 

element  of 

a data 

name  is  defined 

more  than  once. 

The 

functions 

of  this 

procedure  can 

be  described  by 

the 
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followinq  steos.  i 

(1) .  Usinq  the  toutines  q#TVAPS  and  GTVARPS,  obtain  a 

list  of  all  taroet  variables  foi  a data  name. 

r 

(2) .  Usinq  the  routine  mapv,  find  the  intersection 

between  the  set  of  elements  represented  by  any  two 
nodes  in  the  list.  If  the  intersection  is 
non-emoty,  then  some  (or  all)  elements  of  the  two 
nodes  are  multioly  defined,  so  a warninq  messaqe  is 
sent  to  the  user  usinq  the  toutine  RICOMSE. 

9.3.3  The  Procedure  REMOVES  (Remove  Tyoe-8  Edqes). 


Dutinq  the  creation  of  the  Precedence  Matrix,  some 
tyoe-8  relationships  are  established  between  different 
nodes  of  a data  name.  These  tyoe-8  edges  can  make  the 
resultinq  gtaoh  cyclic  if  the  elements  of  the  data  name  is 
defined  recursively.  The  sorting  aloorithms  that  are  used 
later  on  reouite  the  qraoh  to  be  acyclic,  therefore,  those 
type-8  edqes  that  cause  cycle  must  be  removed.  For 
example,  both  the  directed  qtaohs  in  Figures  R.5  and  9.6 
ate  cyclic.  The  function  of  the  REMOVES  otocedurc  is  to 
remove  the  tyoe-3  edges  that  cause  cvcles.  Thus,  the 
graphs  in  Figures  9.5  and  9.6  are  chanoed  to  the  graphs 
shown  in  Figures  R.20  and  9.21  respectively. 


C(1,1, 1) 
(1) 


A(l,  1,1) 

(2) 


*2 


A(  1,  3, 1) 
(3) 


01 


A(2, 4,  1) 
(4) 


Figure  9.20.  Resulting  Graph  after  applying 

the  REMOVE8  Algorithm  to  the  Induced 
Graph  of  Figure  9.5. 
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Figure  9.21. 


Resulting  Graph  after  applying  the 
REM0VE8  Algorithm  to  the  Induced 
Graph  of  Figure  9.6. 
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Alqorithm  A9.4  describes  the  functions  of  the  procedure  i 

REMOVES.  It  takes  the  followinq  vectors  as  input  (where 
# DICT  indicates  the  number  of  dictionary  entries) : 

(1) .  4PRED  - a vector  of  size  #DICT,  and  the  element 

«PRED(I)  represents  the  number  of  oredecessors  of 
node  I. 

(2) .  # PRED8  - a vecotr  of  size  #DICT,  and  the  element 

I PRED8 ( I ) represents  the  number  of  tyoe-8 

predecessors  of  node  I. 

(3) .  ISUCC8  - a vector  of  size  fDICT,  and  the  element 

faUCCR(I)  represents  the  number  of  type-8  successors 
of  node  I. 

(4) .  PREDL  - a vectoi  of  size  n,  where  n = SUM(*PRED), 

and  such  that  PREDL ( KPREDF ( I ) ) TO  PREDL ( f PREDT ( I ) ) 
defines  all  the  predecessors  of  node  I. 

(5) .  PRED8L  - a vector  of  size  n,  where  n * SUM(|PRED8), 

and  such  that  PRED3L ( #PRED8F ( I ) ) TO 

PRED8L(#PRED8T(I) ) defines  all  the  tyoe-8 
predecessors  of  node  I. 

(6) .  SUCC8L  - a vector  of  size  n,  where  n * SUM(fStJCC8), 

and  such  that  SUCC8L ( ISUCC9F ( I ) ) TO 

SUCC8L ( 4SUCC8T ( I ) ) defines  all  the  type-P  successors 
of  node  I. 


(7).  #PREDF , # PREDT,  fPREDBF,  IPRED8T,  4SUCC8F , ISUCC8L, 
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all  of  size  IDICT,  and  define  the  predecessors, 
type-8  oredecessors  and  tyoe-8  successois  as 
mentioned  in  (4),  (5)  and  (8)  above. 

All  the  above  vectors  are  computed  in  the  procedure 
ORT4P,  which  is  described  in  section  °.l,2. 

As  output,  it  generates  the  or der-vectoi  , ORDEPV,  which 
is  of  size  SDICT  and  contains  the  ordered  nodes. 

Steps  1 thru  10  in  the  alaorithm  obtain  the  order 
vector  using  conventional  toooloqical  sorting  technique. 
Steos  12  thru  14  removes  the  tyoe-8  edges  that  cause  the 
cycle.  If  no  tyoe-8  edge  is  removed  in  these  steos,  then 
there  must  be  some  regular  cycles  in  the  graoh  (due  to 
simultaneous  equations,  and  this  will  be  detected  in  the 
procedure  CYCLES,  described  later  on).  In  this  case,  a 
node  that  has  least  number  of  predecessors  is  assumed  to 
be  the  next  candidate  for  ordering  (Steo  15). 


Algorithm  A9.4:  REMOVES. 

Function:  Removes  the  tvpe-8  edges  that  cause  cvcles. 
Called  by:  RICONS. 

Calls:  REMOVF8,  ADJJM. 

Inputs:  fPREO,  IPRED8,  ISUCC8,  PRFD1 , PRF.D8L,  SUCC8L, 

# PREDF , # PPEDT , DPRED8F,  # PRE08T , #SUCC8F, 
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Outputs 
Stepl : 
Step2: 
Step3: 
Step4 : 
Step5: 

Step6 : 

Step7 : 
Step8: 

Steo9 : 
SteolO : 
Steplls 

Stepl2: 

Stepl3: 


Steol4  s 


ISUCC8T. 

ORDERV. 

Set  fORDERED  * 0 and  T# PRED8  * I PRED8 . 

Set  T#OROERED  = IORDERED. 

For  I = 1 to  IDICT  perform  steps  4 thru  10. 

If  # PRED ( I ) > 0 then  qoto  sten  10. 

Do  the  followinq:  IORDERED  * IORDERED  + 1;  and 

ORDERV ( #ORDERRD)  * I; 

For  each  successor  N of  node  I perform  steps  7 
thru  9. 

Set  IPRED(N)  * IPRED (N)  - 1» 

If  ADJM  ( I ,N)  - 8 then  set  T«PRED8(*J) 

T#  PRED8 ( N) -1 ; 

Continue. 

Continue. 

If  IORDERED  ■ #OICT  then  return;  otherwise,  if 
TIORDERED  < IORDERED , then  qoto  step  2,  otherwise, 
continue  to  step  12. 

For  I * 1 to  #DICT  perform  steos  13  thru  14. 

If  # PRED ( I ) ■ T#  PRED8 ( I ) and  TfPREDR(I)  < 

# PRED8 ( I ) then  remove  every  type-8  predecessor  of 
node  I usinq  the  routine  REMOVER  and  then  qoto 
step  2. 

Continue. 


: 


i 

i 


-V, 
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SteDl5:  Find  a source  variable  node  that  has  least  number 
of  predecessors,  and  for  that  node  set  *PRED(I)  = 
0,  and  then  goto  steo  2. 


9.3.4  The  Pt ocedur  e CHKSCON  (Check  Subscript  Consistency). 


This  procedure  checks  for  the  schedulahi l ity  of  the 
array  graphs  using  the  results  obtained  in  section  9.2. 
The  functions  of  the  procedure  are  described  in  the 
following  steDs: 


Stepl : 


Steo2 : 

Step3 : 
Steo4 : 
Step5: 


Step6: 


Using  the  Procedure  GCYCLES,  net  all  the  cycles  in 
the  directed  qraoh.  Let  N be  the  number  of  cycles 
obtained. 

Let  CONSISTENT  be  an  array  of  N elements. 
Initialize  all  its  elements  to  zeros. 

For  I * 1 to  N Perform  steos  4 thru  8. 

If  CONSISTENT ( I ) > 0 then  aoto  step  R. 

Usinq  the  routine  CFHRES,  find  the  sum  of 
thresholds  associated  with  all  the  tyoe-8  edaes  of 
the  I th  cycle  obtained  in  steo  1.  If  the  sum  is 
> 0 then  set  CONSISTENT ( I ) * 1 and  qoto  steo  8. 
Usinq  the  routine  CSMAPV,  find  the  self  mao  vector 
associated  with  every  node  of  the  1 th  cycle.  If 
the  condition  in  Theorem  9.1  is  satisfied  for  this 


I 


I 
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cycle,  then  set  CONSISTENT ( I ) to  2,  and  goto  step 

8. 

Steo7:  Using  the  Procedure  CLINC,  chanqe  the  increment 

associated  with  a node  (in  which  the  computable 
elements  ate  on  riqht),  and  then  apDly  the 
consistency  criterions  in  steps  (5)  and  (6)  aqain. 
Step8:  Continue. 

Step9:  For  I * 1 to  N perform  steps  10  thru  13. 

SteolO:  If  CONSISTENT ( I ) = 1 then  qoto  steo  13. 

Stepll:  If  CONSISTENT ( I ) * 2 then  indicate  that  the  I th 

cycle  is  sequenceable,  but  not  schedulable. 

Stepl2:  If  CONSISTENT ( I ) <*>0  then  indicate  the  error  that 
the  I th  cycle  is  not  seouenceable. 

Stepl3:  Continue. 

9.3.5  The  Procedure  SUE PROP  (Subscript  Propagation) . 

The  process  of  subscript  oropaqation  refers  to  the  task 
of  propaqatinq  subscript  names  and  the  ranges  associated 
with  the  source  variables  of  an  assertion  to  the  tarqet 
variable  of  that  assertion.  It  also  checks  for  the 
consistency  between  the  propagated  subscript  names  (and 
ranges) , and  the  specified  subscript  names  (and  ranges). 
The  process  is  summarized  in  the  following  steps: 

(1).  Associate  a subscript  set  with  each  dictionary  entry 


' ^ ^ 
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that  represents  a field,  interim  or  subscript  name. 

This  set  contains  a set  of  subscript  names  on  which 
the  entry  depends.  For  example,  for  the  dictionary 
entry  that  represents  the  subscripted  data  name: 

A ( 1 , 1 , J ) , the  set  is  (1,7). 

(2) .  Obtain  an  order  vector,  ORDERV,  that  represents  the 

topoloqical  sortina  of  the  dictionary  entries.  And 
for  each  assertion  node  in  ORDERV  perform  the 
followinn  steps  (ORDERV  is  needed  in  order  to 

t 

propaqate  subscript  lists  from  the  input  nodes  to 
the  output  nodes.  Such  prooaqation  (from  first 
entry  to  the  last  entry  in  ORDERV)  allows  us  to 
resolve  some  inconsistencies  in  the  subscript 
specification  of  some  interim  data  names.): 

(3) .  Find  the  set  SR  which  is  a union  of  the  subscript 

names  associated  with  all  the  source  variables  of 
the  assertion.  (If  any  of  the  source  variables  is 
used  as  an  argument  of  a reduction  function  like, 

SUM,  then  the  set  associated  with  that  name  must  be 
modified  accodinoly.)  For  example,  if  the  RHS  of  an 
assertion  is  qiven  by: 

A(I)+B(J,I)+SUM(C(I,K) ) 

then  SR  * ( I , J ) , assuminq  that  the  SUM  function  is 
applied  to  the  last  dimension  of  C.  Note  that  the 
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order  of  subsctipt  names  is  not  maintained. 

(4).  Now  compare  the  set  SR  with  the  set  of  subscript 
names  SL  associated  with  the  tarqet  variable  of  the 
assertion.  The  subscriot  soecif ication  is 
consistent  if  the  two  sets  satisfy  one  of  the 
following  criteria: 


- 

(a) . 

SL 

= 

SR,  or 

(b) . 

SL 

> 

SR,  or 

• 

(c) . 

SR 

< 

SL  - if  the  variable  on  the  LHS  is  an 

interim  data  name. 

The  second  case  is  an  examole  of  an  expansion 
function  like: 

X(I,J)  = Y(J) ; 
or  like: 

X(I)  = 0; 

In  the  third  case,  the  data  description  statement  of 
the  interim  data  name  on  the  LHS  is  modified  to  take 
care  of  the  inconsistency. 

(5).  If  the  sets  SL  and  SR  do  not  satisfy  one  of  the 
criterions  mentioned  in  step  (4),  then  an  attempt  is 
made  to  identify  the  subscript  names  in  the  set 
SL  - {SL  ft  SR}  with  the  subscript  names  in  the  set 
SR  - {SL  ft  3R } . If  any  such  identification  does  not 
cause  conflicts  (like,  having  same  subscript  name 


I 
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for  two  different  source  data  names  or  two  different  i 

target  data  names,  as  described  in  the  procedure 
CSUBRED) , then  the  cor i esoondi nq  subscript  sets  are 

► 

chanqed,  and  the  consistency  check  in  step  4 is 
applied  aqain.  For  examole,  consider  the  followinq 
assertion: 

B(I)  - *(J) * 

For  this  assertion,  SL  * {I},  and  SR  = {]}. 

Therefore,  if  identifying  I with  J does  not  cause 
any  conflict,  the  assertion  is  chanqed  to: 

B(I)  = A ( I ) ; 

A conflict  can  occur  when  A and  R are  of  different 

1 

size. 

(6).  If  an  assertion  contains  more  than  one  tarqet 
variable,  then  the  subsctiot  set  SL  associated  with 
each  tarqet  variable  must  satisfy  one  of  the 

criterions  described  in  steo  (4). 

The  above  functions  are  performed  by  the  procedure 
SUBPROP.  This  procedure  calls  the  routines:  UNIONL, 

XNLIST,  SUBTRL,  UPDINTM , SUBPRC  AND  SUOPRU. 

The  routine  UNIONL  obtains  the  union  of  two  arquement 
subscript  lists.  This  is  used  to  obtain  the  union  of  all 
the  subscripts  associated  with  the  source  variables  of  an 

assertion.  The  routine  XNLIST  is  used  to  find  the  . 1 


I 
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intersection  of  two  argument  lists  and  the  routine  SUBTRL 
is  used  to  subtract  a list  from  another. 

The  routine  UPDINTM  is  used  to  uDdate  the  dimension  of 

4 

an  interim  variable  (see  also,  step  (4)  above),  and  to  ; 

change  the  assertions  that  use  it  (routine  UPDASSN  is  used 

for  this  purpose) . The  routine  SUBPRC  checks  to  see  if  j 

any  elements  in  the  two  arguments  are  equivalent.  If  two  ( 

elements  are  equivalent,  then  one  of  the  elements  is 
deleted  from  the  list.  j 

The  routine  SUBPRU  updates  the  data  names  pointed  by 

« 

POINTER  varaibles.  For  example,  if  the  following 

\ 

assertion  was  used  in  a specification: 

POINTER. MAST  « <expression> ; 

and  the  variable  POINTER. MAST  was  reconnized  as  an  array 
of  dimension  n,  then,  n dimensions  are  added  to  the  data  * 

name  MAST.  Also,  the  dimensions  of  all  the  successors  of 
MAST,  and  their  usage,  are  aoproor iately  updated. 

9.3.6  Cycles  Analysis  (Procedure  CYCLES) . 

The  user  specification  of  a problem  may  be  inconsistent 
in  that  a data  name  may  be  a predecessor  of  itself.  In 
this  case,  there  will  be  cycles  in  the  directed  qraph. 

These  cycles  are  detected  usino  conventional  techniques  in 
the  Procedure  CYCLES.  The  functions  of  the  procedure 
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CYCLES  ate: 

(1) .  To  find  all  the  strongly  connected  cycles  in  the 

graph . 

(2) .  Check  if  there  ate  simultaneous  eauations  in  the 

specification  (among  the  strongly  connected  cycles 
obtained  in  steo  (1)).  >\nd,  if  there  are,  they  are 

grouped  toqether  in  the  Associative  Memory  for  later 
generation  of  a solution  procedure. 

(3) .  If  the  strongly  connected  cycles  in  the  graoh  do  not 

form  a set  of  simultaneous  eauations,  then,  an  enor 
message  is  sent  to  the  user  requesting  him  to  open 
the  loops  detected  in  the  soecif ication. 

The  detailed  description  of  the  CYCLES  algorithm  can 
be  found  in  the  dissertation  by  Cana  (OAN78). 

9.4  Some  Examples. 

In  this  section,  two  examples  are  given.  These 
examples  illustrate  some  of  the  functions  Performed  by  the 
MODEL  Processor  during  the  consistency  and  completeness 
analysis. 

The  first  examDle  is  the  MODEL  specification  shown  in 
Figure  9.22.  In  this  specification,  some  of  the  elements 
of  array  A ate  multiply  defined  and  some  of  them  are 
undefined.  Therefore,  as  described  earlier,  the  procedure 
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RESIC8  generates  the  assertions  $A2001  and  SA2002  shown  in 
the  bottom  of  the  Fiqure  9.22.  The  warning  report  for 
this  example,  which  is  shown  in  Figure  9.23,  indicates  all 
the  multiply  defined  elements  of  the  array. 

The  second  example  is  the  MODEL  specification  in  Fiqure 
9.24.  The  induced  qraph  for  this  example  is  shown  in 
Fiqure  9.10.  As  mentioned  earlier,  the  cycle  in  this 
graph  is  inconsistent,  and  because  we  do  not  split  the 
assertions,  the  error  messaqe  shown  in  Fiqure  9.25  is 
reported  to  the  user. 


_ 


sounce  listing  and  generated  statenents 


«MNl'ISS 


SOURCE  LISTING  »N0  GENERATED  STATEMENTS 


CHAPTER  10 


I 

t 


DOCUMENTATION 

This  chapter  describes  the  fourth  ohase  of  the  MODEL 
Processor,  namely,  the  Documentation  Phase.  The  main 
function  of  this  ohase  is  to  generate  the  recruited  uset 
Lepoits.  Some  of  the  tepoLts  3te  generated  throughout  the 
first  4 ohases.  Therefore,  all  the  documentation  produced 
by  the  first  four  ohases  of  the  Processor  will  be 
described  (includin'!  the  different  error  reports)  in  this 
chapter . 

Basically,  the  MODEL  Processor  aenerates  4 user 
reports.  They  are: 

(1) .  Source  Listing  (SAPOUT) , 

(2) .  Cross  Reference  Reoort  (XREF) , 

(3) .  Formatted  MODEL  Soec i f ic3t ion  Listing 

(MODE LOUT) , 

and  (4).  Precedence  Matrix  Report  (SYSPPINT) . 

The  MODEL  Processor  also  generates  3 kinds  of  error 
reports.  They  ate: 

(1) .  Syntax  error  report  (SAPEPR) , 

(2) .  Semantic  error  report  (ERROP) , 
and  (3).  Warninqs  (WARNING). 

The  names  inside  oarantheses  in  the  above  lists 
indicate  the  file  names  to  which  the  corresponding  reoort 
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is  outout.  , 

Fiqure  10.1  shows  the  time  at  which  each  of  the  above 
reports  ate  generated.  In  the  following,  the  qenetation 
of  different  reports  is  described  briefly.  The  examole  in 
Fiqure  4.5  is  used  to  illustrate  the  different  reoorts. 

10.1  Source  Listing. 

This  report  consists  of  the  input  MODEL  statements  and 
all  the  statements  generated  by  the  MODEL  Processor.  This 
report  is  generated  throughout  the  first  four  phases  of 
the  Processor.  Each  entry  in  the  report  is  identified  by 
the  statement  number  on  the  left.  Every  other  report 
refers  to  a statement  by  referring  to  this  statement 
number.  On  the  rtHS  of  each  statement,  the  user  supplied 
line  number  is  indicated,  if  no  line  number  was  specified 
in  the  user  input  statement,  then  this  represents  the 
sequence  number  of  that  statement.  If  the  statement  was 
generated  by  tb~  system,  this  entry  will  be  replaced  by 

| 

*********  • t 


The 

source 

listing 

for  the 

MODEL 

specification 

of 

Fiqure 

4.5  is 

given  in 

Figure 

10.2. 

This 

listinq 

is 

output 

to  the 

file  SAPOUT,  and 

the  procedure 

WSAPOUT 

is 

used  to 

output 

a line  to 

this  file. 
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10.2  The  Ci oss  Reference  Repot t. 

This  report  consists  of  an  entry  for  each  data  name  or 
an  assertion  name  used  in  the  MODEL  SDecif ication.  The 
different  entries  are  listed  in  alphabetical  order  by  the 
name.  With  each  entry,  the  followinq  information  is 
provided : 

(a) .  Statement  number  in  which  the  name  is  defined  (same 

as  the  statement  number  on  the  I.HS  of  each  entry  in 
Source  Listinq  described  in  section  10.1). 

(b) .  The  user  suoplied  line  number  associated  with  the 

name.  If  the  statement  for  the  name  was  qenerated 
by  the  system,  then  this  entry  will  be  replaced  by 
• «*****•  m 

(c) .  The  dictionary  index  for  the  name. 

(d) .  Attributes  of  the  name,  like,  size,  parent  data 

names,  tyoe  of  data  name,  and  so  on. 

(e) .  A list  of  statement  numbers  that  refer  this  data 

name. 

The  Cross  Reference  Report  for  the  RUN_1  example  in 
Fiqure  4.5  is  shown  in  Figure  10.3.  This  report  is  output 
to  the  file  XREF,  and  the  procedures  BUILOD  and  XREFPRT 
ate  used  to  generate  this  reoort.  The  Procedure  BUILDD 
builds  a dictionary  of  data  names  from  the  directory  of 
the  Associative  Memorv.  The  procedure  XREFPRT  generates 
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the  report  for  each  entiy  in  the  dictionary  created  bv  the 
procedure  BUILOD. 

10.3  The  Formatted  MODEL  Specification  Listing. 

This  tepoit  consists  of  the  foLmatted  listinq  of  the 
MODEL  specification.  It  consists  of  three  sections: 

(1) .  Header  Section, 

(2) .  Data  description  Section, 
and  (3).  Assertions  Section. 

The  statements  in  the  data  descriotions  section  are 
indented  appropriately  to  reflect  the  tree  structure  of 
the  files  and  interim  data  names.  The  assertions  section 
is  divided  into  two  parts.  The  first  oart  consists  of 
simple  assertions  and  the  second  oart  consists  of 
simultaneous  set  of  eouations,  if  anv. 

The  formatted  listinq  for  the  RUN_1  examole  of  Figure 
4.5  is  shown  in  Figure  10.4.  This  listinq  is  outout  to 
the  file  MODELOUT.  The  Procedures  PRTMFD,  PRTSTMT  and 
PRTASSX  are  used  to  generate  this  listing.  PRTSTMT  is 
used  to  list  each  data  descriotion  statement  and  is  called 
by  the  procedure  PRTMED.  The  oiocedute  PRTASSX  is  used  to 
list  all  the  assertions  in  the  MODEL  specification. 
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10.4  The  PL_ecedj?n_ce  ^iati_ix  Reoot  t • 

This  Lepotts  consists  of  a listinq  of  all  the 
relationships  in  the  Precedence  Matrix.  The  listinq  is 
not  in  the  form  of  a matrix,  but  in  the  form  of  a list.  A 
list  of  all  predecessor  nodes  and  the  successor  nodes  of 
each  entry  in  the  orecedence  matrix  is  outout  on  each 
line.  The  actual  relationship  itself  (that  is,  the 
precedence  matrix  entry)  follows  each  oredecessor  and 
successor  node,  by  aoproot iately  enclosinq  it  in 
parantheses. 

The  Precedence  Matrix  Report  for  the  RUM-1  example  is 
shown  in  Fiqure  10.5.  This  report  is  outout  to  the  file 
SYSPRINT,  and  the  Procedure  PPTADJL  is  used  for  qeneratinq 
this  report.  If  recruited,  the  matrix  form  of  the  report 
can  also  be  obtained  usinq  the  procedure  PRTAOJM.  When 
the  number  of  rows  in  the  orecedence  matrix  exceed  50,  it 
is  easy  to  understand  the  List  form  of  the  report  than  the 
matrix  form,  and  therefore,  only  the  list  form  of  the 
Precedence  Matrix  is  illustrated  here. 


Precedence  Matrix  Report  for  the  MODEL  Specification 
in  Fiftjre  4.10. 
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10.5  Syntax  Fr^or  Reno  it  . 

This  error  teooit  is  qenerated  durinq  the  syntax 
analysis  ohase  and  contains  the  syntax  errors  durinq  the 
syntax  analysis  (Phase  -1).  No  syntax  errors  were 
detected  for  the  RUN-1  soecif ication  in  Fiqure  4.5. 
Therefore,  to  illustrate  this  error  reoort,  considei  the 
set  of  statements  oiven  in  Fiqure  10.6.  Four  syntax 
errors  weie  detected  for  this  example,  and  are  shown  in 
the  error  listinq  in  Fiqure  10.7. 

Note  that  whenever  a syntax  error  is  detected,  the 
complete  statement  in  which  the  error  was  detected  is 
discarded.  The  end  of  statement  delimitor  is  used  for 
this  purpose.  An  an  examole,  the  second  error  in  Fiqure 
10.5,  has  discarded  the  statement  (line  40)  till  the 
semicolon.  However,  the  third  eriot  (in  Fiqure  10.5),  has 
deleted  the  whole  line  (line  50),  because  it  could  not 
find  any  semicolon  in  that  line. 

The  syntax  error  teooit  is  outout  to  the  file  SAPRRR, 
and  the  error  teooit  is  qenerated  in  the  routine  $POPF. 


OOOOOOHC4M*  • * 

oooooooooo • # 
oooooooooo#  * 

33333003tJ0i  • 

oooooooc.  o o • * 

OOOOOOO-SOO*  * 

oooooooooo* • 
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10.6  Semant ic  Error 

This  report  is  generated  in  Phase-3  of  the  MODEL 
Processor.  This  reoort  contains  errors  due  to 
inconsistency  and  circularity  in  the  MODEL  specification. 
Again,  in  the  RUN-1  specification  of  Figure  4.5,  no 
semantic  errors  were  detectd.  Therefore,  to  illustrate 
this  error  report,  consider  the  example  shown  in  Fiqure 
10.3.  The  assertion  in  this  example  is  inconsistent  for 
the  reason  mentioned  in  the  corresponding  error  report  of 
Fiqure  10.9.  Note  that,  if  the  user  had  not  specified  the 
assertion  at  all,  then  the  system  would  have  generated  an 
entirely  different  assertion  (using  the  reduction  function 
ANY) , which  is  given  by: 

OA.C  =*  ANY(A.C)  ; 

In  this  case,  the  subscripts  on  the  two  sides  of  the 
assertion  are  consistent. 

The  semantic  error  report  is  outout  to  the  file  ERROR 
(circularity  errors  are  output  to  the  file  ADJMRPT,  see 
GAN78 ) , and  the  procedure  WERROR  is  used  to  outout  a 
message  to  the  file  ERROR. 
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10.7  Warnings . 

Warnings  aie  generated  throughout  the  first  four 
ohases  of  the  MODEL  Processor.  Whenever  an  assertion  or  a 
DD  statement  is  changed  (or  generated)  a watninq  message 
is  sent.  In  addition,  some  form  of  redundencies  in  the 
specification  (like,  subscript  redundency)  is  also 
reported  as  warnings.  The  warning  renort  for  the  RUN-1 
example  in  Fiqure  4.5  is  shown  in  Fiqure  10.10. 

The  warning  messages  are  output  to  the  file  WARNING. 

The  procedure  wwarn  is  used  to  output  a message  to  the 


WARNING  file. 
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CHAPTER  11 


CONCLUSIONS 


This  Chaotei  briefly  describes  the  last  two  phases  of 
the  MODEL  Processor,  namely,  Sequencing  and  Code 
Generation  Phases.  The  desiqn  and  implementation  of  these 
two  phases  ate  not  oart  of  this  dissertation,  howevei , the 
techniques  used  in  the  earlier  version  of  the  system 
( PRY76A)  can  be  used  in  the  desiqn  of  these  phases.  The 
Chapter  also  summarizes  the  conclusions  that  can  be  drawn 
based  on  the  research  described  in  the  dissertation. 


11.1  Sequencing  and  Optimization , 


This  phase  of  the  MODEL  Processor  determines  the 
sequence  of  execution  of  all  events  implied  by  the  MODEL 
specification  using  conventional  Gtaoh  Theory  techniques. 
The  innut  to  this  ohase  is  a cycle  free  directed  qraph 
obtained  from  the  previous  phase.  If  there  are  cycles, 
they  are  reported  to  the  user  in  the  procedure  CYCLES 
described  in  the  previous  chanter  (and  in  GAN78). 

The  sequencing  phase  also  performs  the  scone  and 
iteration  analysis.  Therefore,  the  result  of  this  phase 
is  a set  of  data  structures  representing  the  desired 
sequence  of  Processes  and  flow  of  events,  sequenced  and 
tanked  in  their  order  of  execution.  The  recursions 
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involved  must  also  be  scheduled,  usinq  the  Procedure 
SCHEDUL  described  in  the  previous  chanter. 

There  is  another  function  oei formed  by  the  sequencing 
phase.  That  is,  the  optimization  of  the  sequence  and  the 
data  structures  in  the  sequence.  The  optimization 
techniques  can  be  categorized  into  three  tynes:  (1)  Memory 
optimization,  (2)  Execution  time  optimization  and  (3)  I/O 
Channel  time  optimization.  These  are  briefly  described  in 
the  following. 


11.1.1  Memory  Optimization. 


This  generally  means  generating  optimized  internal 
storage  structure  that  reduces  memorv  requirements.  For 
example,  for  the  following  set  of  statements: 

F ( 1 > * 1; 

FOR  N = 2 TO  100  F (N)  = F(N-1)*N; 

B * F (10)  ; 

C * F(100)  ; 

the  completeness  analysis  process  would  have  generated  a 
data  description  statement  for  P,  which  is  given  by: 

F IS  INTERIM ( (100) ) ; 

and  the  directed  graph  for  the  above  set  of  assertions 
will  be  as  shown  in  Figure  11.1.  For  this  graph,  we  can 
obtain  the  following  schedule  (using  PL/1  notation): 
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F(l)  = 1; 

DO  N = 2 TO  100; 

F (N)  = F(N-1)*N; 

IF  N = 10  THEN  DO; 

IB  = F ( N ) ; 

END; 

IF  N = 100  THEN  DO; 

C = F ( N ) ; 

END; 

END; 

The  above  schedule  can  be  simplified  as  shown  below; 
F = 1; 

DO  N = 2 TO  100; 

F = F * N; 

IF  N = 10  THEN  DO; 

B = F; 

END; 

IF  N = 100  THEN  DO; 

C = F; 

END; 

END; 


As  can  be  seen  from  the  simolified  schedule  above,  the 
aiiay  F can  be  teolaced  by  a scalar.  Such  tyne  of 
optimization  can  be  apolied  to  every  data  name  to  reduce 
the  required  memory  space.  In  some  cases,  more  than  one 
buffer  may  be  needed,  and  in  some  other  cases,  it  may  be 
advantageous  to  represent  iterations  in  the  qraph  by 
functions  as  described  later  in  this  chapter. 

Similar  optimization  technique  can  be  applied  to  the 
input  and  output  data  structures.  For  example,  if  a file 
is  defined  as  consistino  of  100  records  by  the  followinq 
statement : 


INREC  IS  RECORD ( INPUT  FILE, (100)); 


t 

I 

I 

\ 
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then  it  is  not  necessary  to  represent  the  record  as  an 
array  of  100  elements,  ''lost  of  the  times,  it  is  enouqh  to 
represent  only  one  instance  of  such  set  of  records  by 
allocating  space  for  only  the  first  record  and  overlayina 
the  other  records  on  the  first.  However,  this  is  not 
possible  if  the  contents  of  more  than  one  records  are  used 
as  source  of  any  assertion.  Consider,  for  examole,  the 
set  of  MODEL  statements  shown  in  Figure  11.2.  For  these 
set  of  statements,  only  the  I th  and  1+1  th  records  (in 
the  array  RECA)  are  needed  at  any  instant  of  time. 

Therefore,  if  the  structure  of  the  grouo  of  records  in  the 
INPUT  file  is  declared  by  the  following  PL/1  declarations: 

DCL  INPUT  FILE  RECORD  INPUT; 

DCL  1 GA  BASED ($PGA) , 

2 RECA, 

3 CUSTI  PIC  '99', 

3 AMOUNT  PIC  '999V99  * ; 

■i 


I '! 

4 


422 


I 

\ 

l 

| 

* 


INPUT  IS  FILE; 

GA  IS  GROUP( INPUT, (*) ) ; 

RECA  IS  RECORD (GA , (*) ) ; 

CUSTf  IS  FIELD(RECA,PIC  ’9T)  ; 

AMOUNT  IS  FIELD(RECA,PIC  ,9f)9VR':) ’)  ; 

OUTPUT  IS  FILE;  , 

RECB  IS  RECORD (OUTPUT , (*) ) ; 

CUST*  IS  FIELD(PEC(3,PIC '99  *)  ; 

AMOUNT  IS  FIE LD (RECB, PIC' 999V99') ; 

RECB. AMOUNT  = SUM (RECA. AMOUNT) ; 

IF  RECA.CUST#  (I)  = RECA . CUSTt ( I + 1 ) 

THEN  ENDGRP. RECA  = TRUE;  f 

ELSE  ENDGRP. RECA  = FALSE; 

* 


Fiqute  11.2  A Simole  MODEL  Soecif ication. 
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and  $NX_PGA  is  a pointer  to  the  next  record  (1+1  th 
record),  then  the  current  record  and  the  next  record  at 
any  instant  can  be  accessed  using  the  two  pointers  $PGA 
and  $NX_PGA  respectively.  In  this  case,  the  last 
assertion  in  Figure  11.2  must  be  denoted  by  the  following 
PL/1  statement: 

IF  RECA.CUST#  * $NX_PGA  ->  RECA.CUST# 

THEN  ENDGRP. RECA  * TRUE? 

ELSE  ENDGRP. RECA  * FALSE; 

This  technique  has  been  used  (for  restricted  cases)  in 
MODEL-I I (PRY76A). 

11.1.2  Execution  Time  Optimization. 

There  are  many  ways  of  optimizing  a program  to  reduce 
execution  time.  For  example,  transfer  of  an  expression 
from  a loop  will  sometimes  reduce  the  execution  time 
significantly.  For  example,  in  the  expression: 

B (*)  » C ( * ) * SQRT(N) ; 

the  function  SQRT  is  executed  for  each  index  value.  If 
the  above  expression  is  changed  to: 

TEMP  - SORT (N) ; 

B (*)  - C(*)  * TEMP; 

the  function  SORT  is  executed  only  once,  and  therefore. 
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can  save  significant  amount  of  execution  time.  The 
standard  optimization  techniques  used  in  compilers  like, 
PL/1  Optimize!  can  be  used. 

11.1.3  1/0  Channel  Time  Optimization. 

I/O  channel  time  overhead  of  anv  program  is  dependent 
on  the  type  of  file  organization.  For  examole,  the  user 
miqht  have  fixed  record  format  for  a file.  By  chanqinq 
the  record  format  to  fixed  blocked  tyoe,  the  number  of 
EXCPs  required  to  perform  the  I/O  operations  can  be 
reduced  siqnificantiy. 

11.2  Code  Generation . 

The  function  of  this  Phase  of  the  Processor  is  to 
generate  the  otoqram  in  the  tarqet  lanquaqe  (PL/I)  from 
the  optimized  flowchart  obtained  from  the  previous  phase. 
Because,  MODEL  is  a hiqher  lanquaae  than  the  tarqet 
lanquaqe  PL/I,  the  generation  of  code  from  the  flowchart 
is  not  straight  forward.  Some  of  the  additional 
processing  required  during  the  code  generation  are 
summarized  in  the  following: 

(1).  Structuring  of  the  files:  Sometimes  it  is  necessary 
to  restructure  a file  specified  in  MODEL  to  an 
acceptable  structure  in  the  target  language.  This 


restructuring  may  involve  packing  (in  case  of  target 

i 

files)  ot  unoackinq  (in  case  of  source  files)  of  the 
records  in  a file. 

(2) .  Handlinq  abonormal  terminations  (in  the  qenerated 

program)  like,  etroL  conditions,  end  of  file 
conditions,  and  so  on. 

(3) .  Handling  special  keywords  like,  ENDGRP,  SUBSET, 

ABSENT,  and  so  on,  used  in  the  MODEL  specification. 

Techniques  used  by  RIN  (RIN7R)  and  Pnueli  (PRY76A)  can 
be  extended  to  oerform  the  above  functions  of  the 
Processor.  Some  of  the  required  techniques  are  briefly 
described  in  the  following. 

11.2.1  Use  o£  Functions. 

Usually,  each  assertion  can  be  translated  into  a set  of 

target  language  statements  in  appropriate  sequence. 

However,  in  some  cases,  a set  of  assertions  may  have  to  be 

represented  by  a function  or  a procedure.  This  function 

can  even  be  recursive,  if  the  assertions  under 

consideration  define  the  elements  of  a data  name 

recursively.  For  example,  consider  the  following  set  of 

assertions : 

F(l)  - 1; 

F (N)  » F(N-1)*N; 

A - F(X); 

B - F ( Y)  j 
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In  this  case,  because  X and  Y ate  unknown,  it  is  mote 
i appropriate  to  treat  F as  a recursive  function,  and  not  as 

an  array.  (Note  that,  as  mentioned  in  the  earlier 
section,  F can  he  represented  by  a scalar  (or  an  array)  if 
X and  Y ate  both  constants.)  Generation  of  code  for 
recursive  otocedutes  from  the  directed  orach  can  be  auite 
easy  in  some  tarqet  lanquaoes  like  PL/1,  but  some 
additional  effort  may  be  required  while  using  some  other 
tarqet  lanquages  (like  COhOL) . 

11.2.2  Packing  and  Unpacking  Records. 

Packina  and  unpacking  of  records,  described  in  the 
beginning  of  this  section,  teauires  further  discussion. 

i 

If  a record  contains  a data  name  that  repeats  variable 
number  of  times  (or  the  length  of  that  data  name  varies), 

I 

then  in  both  MODEL-I  and  MODEL-II,  it  is  required  that  the 
maximum  allowed  dimension  (or  the  length)  for  that  data 

I I 

name  is  specified  by  the  user.  This  maximum  value  is  used 
' in  allocating  the  record,  and  eventually,  the  individual 

characters  in  the  actual  record  ate  unpacked  to  the 
allocated  record  (in  the  case  of  a source  file  record;  in 
the  case  of  a tarqet  file  record,  the  allocated  record  is 
packed  to  obtain  the  required  record  (RIN76)).  This 
packing  (and  unpacking)  is  illustrated  in  the  examoles  of 
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A IS  RECORD; 

N IS  F I ELD (A, PIC '99 ')  ; 

X IS  FIELD  (A,  (0:100)  , CHAR  ( 40 ) ) ; 

Y IS  FIELD (A, PIC' 999')  ; 

EXIST. X * N; 

Figute  11.3.  An  example  that  tequires 
Unpacking. 
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OCL  !3UF  CHAP  ( 4005  ) VAP; 

OCL  1 A, 

2 N PIC  '09  ' , 

2 X(100)  CHAR (40)  , 

2 Y PIC  *9999  * ; 

READ  FILE (FILENAME)  INTO(BUF); 

1 = 1; 

N = SUBSTR (BUF, I , 2)  ; 

1*1  + 2; 

EXIST. X = N; 

DO  J = 1 TO  EXIST. X; 

X(J)  = SUBS  TP ( 3UF ,1,40)  ; 

I = I + 40; 

END; 

Y * SUBSTR (BUF, I, 4)  ; 


Fiqure  11.4.  Unpackinn  an  Incut  Record. 


Fiqur es  11.3  and  11.4.  In  Fiqute  11.3,  the  record  A is 
shown  to  contain  a field,  X,  which  repeats  variable  number 
of  times  (at  most,  100).  And,  Fiqute  11.4  shows  the 
qenerated  declaration  for  the  record  and  the  unoackinq 
statements  that  ate  used  to  move  the  actual  record  (i.e., 
BUF)  into  the  declared  record,  A. 

It  is  not  necessary  to  oetform  the  tvoe  of  unoackinq 
(or  packinq),  shown  in  Fiqure  11.4,  in  all  cases.  For 
examole,  there  is  no  need  of  unoackinq  for  the 
specification  of  Fiqure  11.3,  if  the  record  is  decalred  as 
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shown  in  Fiqure  11.5.  I 

DCL  1 A, 

' 2 N PIC  '99', 

2 X (NM  REFER (N) ) CHAR(40), 

2 Y PIC  ’999’; 

Fiqure  11.5.  'Jsinq  REFER  option. 

In  the  declaration  of  Fiqure  11.5,  the  REFER  option  of 
PL/1  is  used  to  define  the  size  of  the  repeatinq  variable, 

X.  This  technique  can  be  used  in  most  cases.  However, 
even  in  PL/1  ootimizer,  the  use  of  refer  oDtion  is 
restricted,  and  therefore,  cannot  be  used  in  the  followinq 
cases : 

(1) .  When  the  variable  that  defines  the  repetition  (RHS 
arqument  of  the  refer  ODtion)  is  not  in  the  structure 
as  shown  in  the  example  of  Fiqure  11.6. 

(2) .  When  the  variable  that  defines  the  repetition  is  in 
the  same  structure,  but  physically  located  after  the 
repeating  variable  as  in  the  example  of  Figure  11.7. 

(3) .  When  the  variable  that  defines  the  repetition  is  an 
array  as  in  the  example  of  Fiqure  11.8. 


A IS  RECORD; 

B IS  FIELD(A, (N) ,CHAR(10) ) ; 
C IS  FIELD (A, CHAR (10) ) ; 


Fiquie  11.6. 


A IS  RECORD; 

B IS  FI  ELD (A , (N) , CHAR ( 10 ) ) ; 
N IS  FIELD(A,PIC '99 ') ; 


Fiquie  11.7. 


A IS  RECORD; 

B IS  GROUP ( A , (*) ) ; 

C IS  FIELD(B,PIC'99') ; 

D IS  FIELD(B, (*) ,CHAR<10) ) ; 


EXIST. D(I)  * C ( T ) ; 


Fiquie  11.8. 


I 

I 
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DCL  1 ft  BASED ( $AP) , 

2 $N  PIC  ’99', 

2 B (XN  REFER ( $N)  ) CHAR(IO), 

2 C CHAR.  (10)  ; 

DCL  BUF  CHAR (MAXSIZE ) VAR  BASED (P)  ; 

DCL  BUF 1 CHAR ( MAXSIZE)  VAR  BASED ( SAP) ; 

ALLOCATE  BUF; 

UNSPEC ( SAP)  * UNSPEC(P)  + 2; 

/*NOW  READ  THE  FILE*/ 

READ  FILE  (FILENAME)  INTO(RUFl); 

$N  * N; 


Fiquie  11.9.  PL/1  code  foi  the  exaniDle  in 
Fiqute  11.6. 
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DCL  1 A 3ASE0 ( SAP)  , 

2 $N  PIC 

2 R (XN  REFER ($N))  CHAP  (10), 

2 N PIC  '99';  : 

DCL  BUF  CHAR (MAXSIZE)  VAR  BASED ( P) ; 

DCL  BUF1 (MAXSIZE)  VAR  BASED (SAP) ; 

ALLOCATE  BUF; 

UNSPEC ( SAP)  = UNSPEC(P)  + 2; 

/♦READ  THE  FILE  VOW*/ 

READ  FILE (FILENAME)  INTO(BUFl); 

$N  = SUBSTR (BUFl , LENGTH (BUFl)— 1,2)  • 

Fiqute  11,20.  PL/1  coHe  for  the  example 
in  Fiqure  11.7. 


r 
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In  the  fitst  two  cases,  the  file  structure  can  be  expanded 
slightly  by  including  the  data  names  that  define  the 
dimensions  (of  the  repeating  data  names)  in  the  beginning 
of  the  declared  PL/1  structure.  Then,  the  refer  ootion 
can  be  used  as  usual.  For  example,  the  PL/1  codes  for 
reading  the  records  in  the  examples  of  Figures  11.6  and 
11.7  are  shown  in  Figures  11.9  and  11.10  respectively. 

However,  in  the  third  case,  it  is  probably  easier  to 
use  the  unpacking  and  oacking  technique  as  done  in  MODEL- I 
and  MODEL-I I . 

If  the  target  languaqe  does  not  provide  such  facilities 
as  the  refer  option  mentioned  above,  then  packinq  and 

unpacking  of  data  is  necessarv.  Note  that  the  amount  of 

\ 

restrictions  imposed  by  PL/l-F  (if  used  as  a tarqet 
language)  on  the  use  of  refer  oDtion  is  more  than  that  of 
PL/1  Optimizer. 

11.3  Conclusions.  '* 

■ » 

In  the  previous  chapters,  the  important  features  of  the 
MODEL  Processor  were  described.  MODEL  is  described  as  a 
simple  non-procedural  lanquage  in  which  a non-proqrammer 
can  specify  his  data  processing  requirements  easily.  The 
lanquage  has  the  power  of  very  high  level  lanquaqes  like 
APL,  where  array  operations  can  be  expressed  without  using 


subscripts.  The  importance  of  such  ability  is  not  in 
exotessinq  them,  but  in  analyzing  such  operations  for 
completeness  and  consistency.  The  array  qraoh 
representation  described  in  Chapter  9,  has  served  as  an 
excellent  tool  for  the  analysis  of  iterations  and 
recursions  in  the  MODEL  specification. 

11.3.1  Limitations  of  the  Implementation . 

Even  thouqh  some  criteria  for  solittinq  assertions  were 
described  in  Chanter  thev  were  not  implemented  in  the 
MODEL  Processor.  This  is  because  of  lack  of  efficient 
alqorithms  to  perform  those  solittinqs  (or 
decompositions).  Besides,  we  were  not  sure  if  we  want  to 
make  the  system  too  powerful,  while  advocatinq  its  use  for 
untrained  users . 

Also,  we  have  restricted  ourselves  to  the  analysis  of 
recursions  on  only  one  dimension.  Consistency  analysis  of 
recursive  specifications  on  more  than  one  dimensional 
arrays  has  not  been  tackled  fully,  and  is  a tooic  for 
further  research. 

Even  in  sinqle  dimension  case,  we  have  restricted 
ourselves  to  linear  subscript  expressions.  Only  with  this 
restriction,  could  we  arrive  at  the  conditions  for 
consistency  and  seauenceabil ity  (described  in  Chapter  9). 


} 
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Analysis  of  a qeneial  case  is  quite  comolicated.  Even  in 
the  simple  linear  case,  we  have  restricted  the  range  to  a 

finite  domain  (lather  than  infinite  domain,  which  was 

i 

suggested  by  Dr.  Pnueli).  Extension  of  the  results  of 
finite  domain  to  infinite  domain  may  also  be  quite 
interesting. 

It  was  our  experience  that  an  untrained  user  has  the 
most  difficulty  in  specifying  Drooer  subscripts  for 
iterative  specifications  of  a problem.  Therefore,  it  was 
quite  important  for  us  to  accept  subscript-less 
specification.  If  the  subscripts  are  not  specified,  they 
are  automatically  supplied,  and  if  no  ranqe  for  a 
subscript  is  specified,  then  an  appropriate  range  (usually 
from  the  data  description  statement  of  the  data  name  with 
which  the  subscript  is  associated)  is  supplied.  However, 
cate  must  be  used  in  spocifyino  complex  ooerations  like, 
outer  product.  For  example,  if  3 is  a matrix,  and  A is  a 
vector,  then  there  is  no  way  of  expressing  the  relation: 

V i , j B ( I , J ) - A ( I ) * A ( J ) 


I 
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using  oui  subset iot-less  lanquaqe  (at  least  in  the  current 
implementation)*.  Of  course,  this  tyne  of  problem  is 
resolved  in  hiqher  languages,  like  APL,  bv  using  soecial 
operators. 

11.3.2  Interpretation  of  Assertions. 

Assertion  solittinq  is  not  really  necessary  if  we  are 
willing  to  "interpret"  the  assertions  in  the  loon  which  is 
found  to  be  unschedulable.  This  may  lead  to  inefficient 
programs  and  reporting  of  some  of  the  inconsistencies  may 
have  to  be  delayed  until  the  execution  of  the  generated 
program.  For  example,  for  the  set  of  assertions  given  in 
Figure  9.14,  we  can  obtain  the  PL/1  program  shown  in 
Figure  11.11. 


* If  the  assertion: 

9 = A * Aj 

is  specified  in  ^ODFL , where  9 is  a matrix  and  A is  a 
vector,  then  the  assertion  is  changed  to  the  following: 


9(1, J)  - A (3)  * A(J) 


I 


OCL  S$A  ( 100)  EIT(l)  1NIT ( (100)  *0 '3)  ; 

DCL  $ASW  BIT ( 1 ) INIT (*1’3); 

DO  WHILE  ( $ASW) ; 

$ASW  = '0 'b; 

Al:  /*  ASSERTION:  A1  */ 

IF  $$ A ( 1 ) THEN  DO; 

A ( 1 ) = 0;  \ 

\ 

$ASW  = '1 *3;  ‘ 

J 

S$A  ( 1 ) = 'l'B;  1 

END;  i 

A2 : /*  ASSERTION:  A2  */ 

DO  I = 1 TO  50; 

IF  $$A  (2*1)  THEN 

/*  IF  A (2*1)  MOT  COMPUTED*/  \* 

IF  SSA(I)  THEN  DO; 

/*  AND  A ( I ) IS  COMPUTED*/ 

A ( 2* I ) = A ( I ) +1 ; ‘ 

$ASW  * '1  '3; 

$SA(2*I)  * 'l'B; 

END; 

END;  /*DO  I*/ 


Figure  11.11. 
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/*  ASSERTION:  A3  */ 

00  I = 31  TO  50; 

IF  $SA (2*1-1)  THEN 
/*  IF  A ( 2* 1-1 ) NOT  COMPUTED*/ 
IF  $$A ( 2* I ) THEN  DO; 

/*  AND  A ( 2* I ) IS  COMPUTED*/ 
A(2*I-1)  = A ( 2* I ) +2 ; 

SASW  = ’1 '«; 

$$A (2* 1-1 ) = •l'B; 


END;  /*D0  I*/ 


A3:  /*  ASSERTION:  A4  */ 

DO  I = 1 TO  29; 

IF  $$A(2*I+1)  THEN 

/*  IF  A ( 2* 1+1 ) NOT  COMPUTED*/ 

IF  $$A ( 2* 1+5 ) THEN  DO; 

/*  AND  A (2*1+5)  IS  COMPUTED*/ 
A ( 2*1  + 4 ) = A ( 2*1  + 5) +1 ; 

SASW  = 'l'B; 

$$A (2* 1+1 ) = 'l'B; 


END;  /*D0  I*/ 
END;  /*ENO  DO  WHILE*/ 


Figure  11.11, 


I 
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( 

I 
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r 

/♦NOW  CHECK  FOR  COMPLETENESS  AND  CONSISTENCY*/ 

I 

IF  ANY ( $$A)  THEN  DO; 

t 

/♦SOME  ELEMENTS  ARE  HOT  COMPUTABLE 
SEND  AN  ERROR  MESSAGE*/ 

END; 


Figure  11.11.  A PL/1  Program  for  the  Assertions 
in  Figure  9.14. 
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In  the  Ptoqram  of  Figure  11.11,  an  array  of  switches 
($$A)  is  associated  with  the  array  A.  An  element  of  $$A 
is  set  if  the  corresponding  element  in  A is  comDuted.  The 
aopl icabi 1 ity  of  each  assertion  to  comoute  an  element  is 
tested  repeatedly  till  no  more  computation  is  oossible 
( $ASW  = '0')»  or  till  all  the  elements  ate  computed. 

The  interpretation  technique  described  above  is  auite 
useful  because  it  can  be  aoolied  not  only  to  unschedulable 
loops,  but  also  to  the  loops  containina  nodes  whose  bounds 
are  non-integers.  However,  as  can  be  seen  from  the 
program  of  Figure  11.11,  it  may  qenerate  inefficient 
programs.  In  some  cases,  a better  solution  can  be 
obtained  by  moving  paLt  of  interpretation  (durinq  the 
execution  phase)  to  the  consistency  analysis  phase. 
Consider,  again,  the  set  of  assertions  in  Figure  9.14.  We 
can  move  all  the  interpretations  (implied  by  the  program 
in  Figure  11.11)  to  the  analysis  phase,  and  also,  durinq 
the  interpretation  we  can  obtain  an  ordered  set  of 
elements  that  were  computed  using  each  assertion,  in  each 
pass  in  the  "DO  WHILE"  loop.  For  the  example  under 
consideration,  we  can  see  that  during  the  first  pass  we 
can  comoute  the  elements: 

1 using  Al, 

2,4,8,16,32,64  using  A2, 
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63  usinq  A3  and 

59,55,51,47,43,39,35,31,27, 

23,19,15,11,7,3  usinq  A4. 

During  the  second  oass,  we  can  compute  the  elements: 

6,12,14,22,24,28,30,38,44,46, 

48,54,56,60,62,70,76,78,86, 

88,92,94,96  usinq  A2, 

61,69,75,77,85,87,91,93,95  usinq  A3  and 

57,53,49,45,41,37,33,29, 

25,21,17,13,9,5  usinq  A4. 

And,  finally,  during  the  third  Dass,  we  can  comoute 
the  elements: 

10,18,20,26,34,36,40,42,50,52, 

58,66,68,72,74,80,82,84,90,98, 

100  usinq  A2  and 

65,67,71,73,79,81,83,89,97,99  usinq  A3. 

From  the  above  set  of  numbers,  we  can  qenetate 
though,  it  is  not  trivial)  the  required  PL/1  Dtoqram, 
of  which  is  shown  in  Fioure  11.12. 
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/*  PASS  1*/ 

) /‘ASSERTION:  A1‘/ 

A(l)  = 0; 

/‘ASSERTION:  A2*/ 

I = 1; 

DO  WHILE  ( I<50) ; 

A ( 2* I ) = A ( I ) +1; 

I = 2*1; 

END; 

/‘ASSERTION;  A3*/ 

A ( 63)  = A ( 64 ) + 2; 

. /‘ASSERTION:  A4*/ 

DO  I = 29  TO  1 PY  -2; 

A ( 2‘ 1+1 ) = A ( 2*1+5 ) +4 ; 

END; 

/*  PASS  2*/ 

/‘ASSERTION:  A2*/ 

DO  I = 3 TO  47  BY  4; 

A (2*1)  * A ( I ) + 1; 

END; 

► 

DO  I * 6 TO  46  BY  9; 

A (2* I)  * A (I)  +1; 


( 

l 

r 

f 


) 


Figure  11.12. 


443 


I 

l 

| 

f 

l 

r 

END;  v I 

00  I = 12  TO  44  3Y  16; 

A (2*  I)  = A (I)  + 1; 

f 

END; 

A (48)  = A ( 24 ) + 1; 

A ( 96 ) * A ( 4 8 ) + 1; 

/♦ASSERTION;  A3*/ 
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Note  that  the  ptooram  in  Fiquie  11.12  is  lonqer  than 
that  in  Fiqure  11.11  because  of  non-linear  nature  of  the 
computations.  However,  the  program  in  Fioure  11.12  does 
not  use  any  additional  array  of  switches  (like  $$A  in 
Fiqure  11.11).  To  obtain  the  oroqram  in  Fiqure  11.12,  we 
need  a set  of  alqorithms  to  solit  a set  of  elements 
obtained  in  any  pass  into  subsets  of  elements  that  can  be 
represented  by  a regular  do  looo.  Development  of  such 
alqorithins,  or  better  techniques  than  the  ones  described 
above  needs  further  research.  Note  that  the  program 
qenerated  for  the  qtanh  is  Fiqure  9.14  from  the  assertion 
splitting  procedures  described  in  Chaoter  9 will  be  almost 
similar  to  the  Dtoqtam  shown  in  Figure  11.12. 

11.3.3  on  *^3f}“mach ine  Interface. 

There  is  one  additional  drawback  with  the  current 
implementation.  Except  during  svntax  analysis  phase,  a 
detected  error  is  teoorted  to  the  user,  but  no  dialoque 
between  the  user  and  the  Processor  is  allowed.  In  other 
words,  the  user  cannot  respond  to  any  of  the  error 
reports.  Durinq  the  syntax  analysis,  however,  an  error  is 
reported  to  the  user  as  soon  as  it  is  detected,  and  the 
user  can  immediately  reenter  the  line,  thus  rectifying  the 
enot.  Even  such  a dialoque  is  not  too  good,  and  a "true" 
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I 

P 

automatic  ptoqtamminq  system  should  have  the  facility  to 
invoke  an  editor.  Provisions  must  also  be  provided  to 
process  user  responses  in  the  later  phases  of  the  MODEL 

r 

processor,  otherwise,  debuqqinq  a specification  can  be 
quite  laborious  and  inefficient. 

Despite  the  restrictions  mentioned  above,  the  analysis 
f.  techniques  described  in  this  dissertation  have  qreatly 

reduced  the  effort  requited  by  the  user  for  specifyinq  a 
• problem,  and  is  a biq  step  in  the  automation  of  software 

development  process.  Even  thouqh  much  research  in  this 
field  lies  ahead  (as  can  be  seen  from  the  above 
observations) , we  feel  that  this  research  has  made  a biq 
contribution  towards  achievinq  a simpler  man-machine 
interface. 
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APPENOIY  A i 

I 

In  this  aooendix,  a brief  description  of  each 
subroutine  used  in  MODEL  Processor  is  oiven.  The  first 
column  is  the  otoceduie  name  oi  entry  Doint  name.  Second 
column  indicates  tne  procedure  name  in  which  the  entry 
name  in  column  1 is  located.  Third  column  indicates  the 
arquments  of  the  orocedure  or  entry  ooint.  Standard 
conventions  are  used  to  indicate  the  attributes  of  the 
arguments.  Each  argument  is  Prefixed  by  a number  or 
1 th  argument  is  prefixed  by  the  number  "i".  If  the 
procedure  is  used  as  a function  (using  the  returns 
attribute),  then  the  return  attributes  are  prefixed  by 
Column  4 indicates  all  the  routines  that  the 
procedure  calls.  *<Jote  that  this  column  is  empty  if  the 

first  column  represents  an  entry  ooint.  Fifth  column  r 

indicates  the  routines  that  use  the  procedure  or  the  entry 

ooint.  And,  finally,  the  last  column  oives  a brief 

description  of  the  functions  of  the  orocedure  or  entry 

point. 
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ADoenlix  n 

arapendix  lists  the  information  disolayed  by  each 
the  HF1LP  command.  The  usaqe  of  each  command  is 
by  the  entries  in  lowet  case  letters. 
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help  model 


FOLLOWING  TYPES  OF  STATEMENTS  ARE  ACCEPTED  BY  MODEL: 

(1) .  MODULE  STATEMENT  (MODULE). 

(2) .  SOURCE  FILE  STATEMENT  (SOURCEF). 

(3) .  TARGET  FILE  STATEMENT  (TARGETF) . 

(4) .  REFER  STATEMENT  (REFER). 

(5) .  MEDIA  DESCRIPTION  STATEMENT  (MEDIA). 

(6) .  FILE  DESCRIPTION  STATEMENT  (FILE). 

(7) .  RECORD  DESCRIPTION  STATEMENT  (RECORD). 

(8) ,.  GROUP  DESCRIPTION  STATEMENT  (GROUP). 

(9) .  FIELD  DESCRIPTION  STATEMENT  (FIELD). 

(10) .  INTERIM  DESCRIPTION  STATEMENT  (INTERIM). 

(11) .  SUBSCRIPT  DESCRIPTION  STATEMENT  (SUBSCRIPT). 

(12) .  ASSERTIONS  (ASSERTION). 

TO  DISPLAY  THE  SYNTAX  AND  DESCRIPTION  OF  ANY  STATEMENT  ENTER: 

HELP  <STMT-TYPG>  WHERE  <STMT-TYPE>  IS  THE  STATEMENT  TYPE,  WHICH 
IS  ENCLOSED  IN  PARENTHESES  IN  THE  ABOVE  LIST. 

FOR  EXAMPLE,  THE  FOLLOWING  STATEMENT: 

HELP  FIELD  WILL  DESCRIBE  THE  SYNTAX  AND  THE  DESCRIPTION  OF 
FIELD  STATEMENT. 


help  module 


PURPOSE:  TO  GIVE  A NAME  TO  THE  DESIRED  MODULE.  SYNTAX: 
<MODULE-STATEMENT>: :«  <NAME>  IS  MODULE} 


help  soutcef 


PURPOSE:  TO  INDICATE  THE  NAMES  OF  THOSE  FILES  WHICH  ARE  SOURCE  OR 
INPUT  TO  THE  DESIRED  MODULE.  SYNTAX: 

<SOURCE-FILE-STATEMEIIT>:  :•  SOURCE  ( FILE  ) : 

<FILENAME>  (,  <FILENAME>  ]*  WHERE  <FILENAME>  IS  THE 
NAME  OF  A FILE. 


help  tergetf 


PURPOSE:  TO  INDICATF  THE  IIA.ME  OF  THOSE  FILES  WHICH  ARE  TARGET  OR 
OUTPUT  TO  THE  DESIRED  MODULE.  SYNTAX: 

<TARGET-FILE-STATE.MENT>:  :•  TARGET  I FILE  ) : 
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<FILENAME>  (,  <FILENAM£>  ]*  WHERE  <FILENAME>  IS  THE 
NAME  OF  A FILE. 


help  lefei 


PURPOSE:  TO  INCLUDE  A SPECIFIC  SECTION  OF  A MODULE  IN  THE  LIBRARY. 
SYNTAX: 

<REFER-STATEMENT> : :*  REFER  : <MODULE-NAME>  <SECTION-NAME> 

WHERE-  <SECTION-NAME>  IS  THE  NAME  OF  THE  SECTION  (IN  THE  MODULE 
SPECIFIED  BY  <MODULF-NAMF> ) WHICH  IS  TO  BE  INCLUDED  IN 
THE  MODEL  SPECIFICATION. 


help  help 


"HELP"  IS  A TEACHING  AID  PROVIDED  IN  MODEL.  IT  DESCRIBES  THE 
ALLOWED  STATEMENTS  IN  MODEL  AND  THE  CORRESPONDING  SYNTAX.  IT  ALSO 
GIVES  A BRIEF  DESCRIPTION  OF  THE  ERROR  CODES  DISPLAYED  DURING  SYNTAX 
ANALYSIS.  THERE  ARE  ESSENTIALLY  FOUR  USES  OF  HELP: 

(1) .  HELP  MODEL 

THIS  USE  DISPLAYS  ALL  THE  ALLOWED  STATEMENTS  IN  MODEL. 

(2) .  HELP  SYNTAX 

THIS  USE  DISPLAYS  THE  CONVENTIONS  USED  IN  THE  SYNTAX  OF 

MODEL  STATEMENTS. 

(3) .  HELP  <STATEMENT-TYPE> 

THIS  USE  DISPLAYS  THE  SYNTAX  AND  DESCRIPTION  OF  A 

PARTICULAR  TYPE  OF  STATEMENT  SPECIFIED  BY  <STATEMENT-TYPE> . 

(4) .  HELP  <ERROR-CODE> 

THIS  USE  DISPLAYS  A SHOPT  DESCRIPTION  OF  A PARTICULAR 

TYPE  OF  ERROR  SPECIFIED  BY  <ERROR-CODE> . 

IF  NO  ARGUMENT  IS  SPECIFIED  IN  THE  "HELP"  STATEMENT,  THEN  A SHORT 
DESCRIPTION  OF  THE  MOST  RECENT  SYTAX  ERROR  ENCOUNTERED  DURING  THE 
SYNTAX  ANALYSIS  IS  DISPLAYED.  IF  NO  ERROR  WAS  ENCOUNTERED,  THEN 
•HELP  MODEL"  IS  ASSUMED. 


help  syntax 


IN  THE  DESCRIPTION  OF  THE  SYNTAX  OF  MODEL  STATEMENTS,  THE 
FOLLOWING  CONVENTIONS  WILL  BE  USED: 
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(1) .  ITEMS  INSIDE  SOUARE  BRACKETS  ARE  OPTIONAL. 

(2) .  IF  AN  ASTERISK  FOLLOWS  A RIGHT  SQLAPE  BRACKET, 

THEN  THE  ITEM  INSIDE  THE  SOUARE  BRACKET  CAN  REPEAT 
ZERO  OR  MORE  TIMES. 

(3) .  { X I Y ) - INDICATES  THE  SELECTION  OF  EITHER  X OR  Y. 

(4) .  <X>  REFERES  TO  A GENERIC  CLASS  OF  ITEMS  FOR  WHICH  A SPECIFIC 

ITEM  NEEDS  TO  BE  SUBSTITUTED. 

(5) .  ALL  OTHER  WORDS  REFER  TO  SPECIFIC  MODEL  VOCABULARY. 

BUT  FOR  THE  CONVENTIONS  (1"),  (2)  AND  (3),  THE  SYNTAX  USES  REGULAR 

BNF  NOTATION. 


help  media 


PURPOSE:  DESCRIBES  THE  MEDIUM  IN  WHICH  THE  SOURCE  OR  TARGET 
FILE  RESIDES.  SYNTAX: 

<MEDIA-STATEMENT> : : ■ <NAME>  IS  MEDIA 

( (<APGUMFNT>f , (ARGUMENT*) *) 1 WHERE- 

<ARGUMENT>  HAS  THE  FOLLOWING  SYNTAX: 

(ARGUMENT*: :•  BLOCKSIZE  » < INTEGER* 

I RECORDSIZE  « < INTEGER* 

I ORG  - (SAM  | ISAM  I DIRECTl 
I RECFM  - {FIXED  I FB  | VARIABLE  I 
V3  I VS  I UNDEFINED) 

I UNIT  - { TAPE  I CARD  | PUNCH  I DISK  | 
TERMINAL  ) 

I DISP  « { OLD  | SHR  | NEW  I MOD  } 

I TLABEL  - { SL  | NL  I BLP  ) 

I PARITY  ■ { ODD  I EVEN  } 

I CODE  - { EBCDIC  I BCD  | ASCII  ! 

I TRACKS  -(7|9) 

I DENSITY  - { 200  | 506  I 800  | 1600  ) 

I IL  - <NAME> 

I EL  - <NAME> 

I TAB  ■ ( < INTEGER*  {,< INTEGER* J * ) 

I PAGESIZE  - < INTEGER > 

I LINESIZE  - < INTEGER* 


help  file 


PURPOSE:  TO  DESCRIBE  A FILE  AND  ITS  ATTRIBUTES.  SYNTAX: 
<FILE-STATEMENT>: <NAME>  IS  FILE  { ( (PARENT-NAME* 

( , <ARGUMEMT> 1 * ) ) WHERE-  THE  < ARGUMENT*  HAS  THE 

FOLLOWING  SYNTAX: 

(ARGUMENT*: :•  BLOCKSIZE  - <INTEGER> 

I RECORDSIZE  • (INTEGER* 

I ORG  > (SAM  I ISAM  I DIRECT) 

I RECFM  ■ (FIXED  | FB  I VARIABLE  I 
VS  | VS  I UNDEFINED) 

I UNIT  - ( TAPE  | CARD  | PUNCH  | OISK  | 
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TERMINAL  } 

OISP  = { OLD  | SHR  I NEW  t MOD  } 

TLABEL  = { SL  I NL  | BLP  ) 

PARITY  * { ODD  | EVEN  } 

CODE  =*  ( E3C0IC  I BCD  I ASCII  ) 

TRACKS  ={7|9> 

DENSITY  «=  { 200  I 556  I 800  I 1600  } 

IL  = <NAME> 

EL  = <NAME> 

TAP  = ( < I NTEGER)  ( , < INTEGER) ] * ) 

PAGES  I ZE  = <INTEGFR> 

LINES IZE  = < INTEGER) 

KEY  = <KEY-HAM2> 

DSN  = < DATASET- NAME) 

FILE*  = < INTEGER) 

SPACE  = ( {TPK  | CYL  | < I NTEGER  > } 

[,  [ <PRIMARY>  1 f , KSSCnjDARY)] 

{ , ( <DI RECTORY) ) ] ] 

WHERE  <PRIMARY>,  <SECONOARY>  AND  <DI»ECTORY>  ARE  INTEGER  CONSTANTS 
THAT  SPECIFY  THE  UNITS  REQUIRED  FOR  PRIMARY  EXTENT,  SECONDARY  EXTENT 
AND  THE  DIRECTORY  OF  THE  DATASET  RESPECTIVELY.  ALSO,  THE 
<PARENT-NAME>  IN  THE  ABOVE  SYNTAX  SPECIFIES  THE  PARENT  MEDIA  NAME. 


help  record 


PURPOSE:  TO  DESCRIBE  A RECORD  OF  A FILE  AND  ITS  ATTRIBUTES.  SYNTAX: 
<RECORD-STATEMENT> : : = <N A M E > IS  RECOPD 

f (< PARENT-NAME) [ , < ARGUMENT)] * ) } WHERE-  <ARGUMENT>  HAS 
THE  FOLLOWING  SYNTAX: 

<ARGUMENT> : : ■ KEY  « <KEY-NA'1E> 

I ( <S I ZE)  { , <SIZS> | * ) WHERE-  <KEY-NAME>  IS  THE  NAME  OF 
THE  KEY  FIELD  AND  <SIZE>  IS  A 

NAME  OR  INTEGER  CONSTANT  THAT  SPECIFIES  THE  SIZE  OF  A 
DIMENSION  OF  THE  RECORD. 


help  group 


PURPOSE:  TO  DESCRIBE  A GROUP  IN  A FILE  OR  IN  A RECORD  OF  A FILE. 
SYNTAX: 

<GROUP-STATEMENT>: :=  <NAME>  IS  GROUP 

| (<PARENT-NAME>  f , <ARGUMENT>) *) ) WHERE-  <ARCUMENT>  HAS 
THE  FOLLOWING  SYNTAX: 

<ARGUMENT>: :»  KEY  ■ <KEY-NAME> 

I (<SIZE>  ( , <SI ZE) 1 * ) WHERE-  <KEY-NAME>  IS  THE  NAME  OF 
THE  KEY  FIELD  AND  <SIZE>  IS  A 

NAME  OR  INTEGER  CONSTANT  THAT  SPECIFIES  THE  SIZE  OF  A 
DIMENSION  OF  THE  GROUP. 


I 
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help  field 


PURPOSE:  TO  DESCRIBE  A FIELD  IN  A GROUP  OR  A RECORD  STRUCTURE  OF 
A FILE.  SYNTAX: 

<PIELQ-STATEMENT> : :=  <NAME?  IS  FIELD 

[ (<PARENT-NAME? f ,<ARGUMENT?1 *) 1 WHERE-  <ARGUMENT>  HAS 
THE  FOLLOWING  SYNTAX: 

XARGUMENT? : : * (<SIZE?  (,<SI7E?)*) 

I CHAR  [ ( <LENGTH>) 1 I VAR  J 
I BIT  ( ( XLENGTH? ) 1 |VAR) 

I BIN  (FIXED  | FLOAT  1 ( ( < PREC I5IQN?) 1 
I DEC  {FIXED  | FLOAT  I ( ( <PREC I S ION> )) 

I PIC  ( ^PICTURE-SPECIFICATION)  • ) WilERE- 
<SIZE>  AND  <LENGTH>  ARE  EITHER  INTEGER  CONSTANTS  OR  NAMES  THAT 
RESPECTIVELY  SPECIFY  THE  SIZE  OR  LENGTH  OF  THE  FIELD  NAME. 
< PICTURE-SPEC  I F ICATIOil?  SPECIFIES  THE  PICTURF  ATTRIBUTES  OF  THE 
FIELD  NAME  AND  THE  <P?ECISION?  FOLLOWS  THE  FOLLOWING  SYNTAX: 

< PRECISION? : : = < INTEGER? (,< INTEGER? ] AND  THE  TWO  INTEGER 
CONSTANTS  SPECIFY  THE  NUMBER  OF  DIGITS  AND  THE  NUM3ER  OF  FRACTIONAL 
DIGITS  FOR  THE  DATA  NAME. 


help  interim 


PURPOSE:  TO  DESCRIBE  THE  ATTRIBUTES  OF  THE  INTERIM  DATA 
NAMES  USED  IN  THE  SPECIFICATION.  SYNTAX: 
XINTERIM-STATEMENT? : : * <NAME?  IS  INTERIM 

( ( < ARGUMENT? I , < ARGUMENT? ) * ) ) WHERE- 
XARGUMENT? : : ■ (<SIZE?  (,<5I7,E?1*) 

I CHAR  ( (XLENGTH? ) j (VAP1 
I BIT  ( ( XLENGTH?) I (VAR) 

I BIN  (FIXED  | FLOAT)  [< < PREC IS  ION? ) ) 

I DEC  (FIXED  | FLOAT)  [ ( < PREC 1 5 ION ? ) ) 

I PIC  ( ^PICTURE-SPECIFICATION? ’)  WHERE- 
<SIZE?  AND  <LENGTH?  ARE  EITHER  INTEGER  CONSTANTS  OR  NAMES  THAT 
RESPECTIVELY  SPECIFY.  THE  SIZE  OR  LENGTH  OF  THE  INTERIM  NAME. 
< PICTURE-SPECIFICATION?  SPECIFIES  THE  PICTURE  ATTRIBUTES  OF  THE 
INTERIM  NAME  AND  THE  <PRECISION?  FOLLOWS  THE  FOLLOWING  SYNTAX: 

< PRECISION? : : ■ < INTEGER? ( ,< INTEGER?)  AND  THE  TWO  INTEGER 

CONSTANTS  SPECIFY  THE  NUMBER  OF  DIGITS  AND  THE  NUMBER  OF  FRACTIONAL 
DIGITS  FOR  THE  DATA  NAME. 


help  subscript 


PURPOSE:  TO  DESCRIBE  THE  ATTRIBUTES  OF  A SUBSCRIPT  NAME.  SYNTAX: 
<SUBSCRIPT-STATEMENT? : : ■ <NAME?  IS  SUBSCRIPT 
f (< PARENT-NAME? ( , ( <LOWER-BOUND? | 

(, f <UPPER-BOUND? ) [ ,<INCREMENT?1 ) ) ) ) WHERE- 
< LOWER-BOUND? , <UPPBR-BOUND?  AND  INCREMENT?  ARE 
INTEGER  CONSTANTS. 


i 
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help  asseition 


PURPOSE:  TO  SPECIFY  THE  RELATIOXSH I PS  BETWEEN  DATA  NAMES 
USED  IN  THE  MODEL  SPECIFICATION.  SYNTAX: 
<ASSERTION-STATE'1ENT) : :=  (<  ASSERT I ON -NAME > :]  ASSERTION) 

< ASSERTION): :=  <IF-CLAUSS>  <ASSERTION> 

(ELSE  <A3SERTION>  ) 

I <FOR-CLAUSE>  <ASSERTIO’l> 

I <DATA-NAME>  = <ARITH’1ETIC- EXPRESS  ION>  WHF.RE- 
< IF-CLAUSh> : : = IF  <300LEAN-EXPRESS I ON > THEN 
< FOR -CL A USE) : : * FOR  <3UESCE I PT-NANE)  = < LOWER -BOUND) 

( TO  <UPPER-BOUNl)>  ) (BY  < INCREMENT)  J LET  WHERE- 
< LOWEP-BCUND) , CUPPER-BOUND),  AND  < INTEGER)  ARE  INTEGER 
CONSTANTS;  AND  < BOOLE AN-FV PRESS  ION ) AND 

< ARITHMETIC -EX PRESS  ION)  HAS  THE  REGULAR  SYNTAX  (AS  IN  PL/1). 


INDEX 


< Operator  139 
<*  Operator  139 
+ Operator  142 
I I Operator  139 
k Operator  139 
$POPP  Procedure  410 
* Operator  140 
**  Operator  140 
■ Operator  139 
- Operator  142 
/ Operator  140 
> Operator  139 
>*  Operator  139 

*DICT  Variable  247,  253,  362,  379,  380,  381 

tOPENTRIES  entry  305 

IPRED  Vector  294,  379,  380,  381,  382 

•PREDP  Vector  379,  380,  381 

tPREDT  Vector  379,  380,  381 

IPRED8  Vector  379,  380,  381 

IPRED8P  Vector  294,  379,  380,  381 

IPRED8T  Vector  379,  380,  381 

ISUCC  Vector  379,  380,  381 
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f SUCC8  Vector  294,  379,  380,  381 
♦SUCC8F  Vector  379,  380,  381 
ISUCC8T  Vector  379,  380,  381 
- Keyword  138 
* Operator  139 
Acyclic  310 

Analysis  of  Array  Graph  7 

Analysis  of  Directed  Graph  35,  39 

Analysis  of  MODEL  Specification  4,  5,  6 

Arithmetic  Expression  100,  112,  138,  139,  160 

Arithmetic  Expression  Entry  182,  233 

Array  Graph  7,  14,  194,  306,  309-373 

Array  Graph  - Definition  309 

Assertion  Entry  180,  233 

Assertion  Name  100,  113 

Assertions  9,  24,  44,  45,  51,  57,  99,  158 
Assertions  Section  99,  400 

Associative  Memory  4,  13,  36,  38,  126,  131,  146,  147, 

203-207,  285,  287,  288,  388 
Attributes  40 
Automated  Aids  1 
Automatic  Programming  1,  27 
ABSENT  Keyword  47,  114,  425 

ACRADJN  Procedure  204,  206,  244,  246,  248,  253 
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ACRCP  Procedure  217,  218 
ACRDC  Procedure  171,  173 
ACRDFP  Procedure  171,  174 

ACRDICT  Procedure  204,  207,  209,  217,  222,  226,  234 

ACROR  Procedure  171,  173,  247 

ACRDRA  Procedure  171,  174,  175 

ACRDRS  Procedure  171,  174 

ACROUP  Procedure  205,  208,  209,  214,  217 


ACRD0 

Procedure 

171, 

172, 

217, 

224 

ACRD1 

Procedure 

171, 

173, 

226, 

227 

ACRPTR  Procedure  205,  208,  209,  210,  220,  222,  224 

ACRSTVS  Procedure  205,  208,  209,  225,  226,  231,  234,  236 

ADD  Command  199,  201 

ADJM  Procedure  178,  179,  380 

ADJNCS  Procedure  177,  178 

ADJMRPT  File  413 

ADJMRS  Procedure  177,  178 

ADJMSET  Procedure  177,  179,  247,  267 

ANY  Function  119,  292,  413 

APL  9,  26 

ARE  Keyword  138 

ASPLIT1  Procedure  347,  348,  375 

ASPLIT2  Procedure  351,  352,  375 

ASUBST  Procedure  205,  208,  209,  210,  238 
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Back-tracking  127 

Bietmann  24 

Binary  Tree  149 

Bipartite  Graph  311 

Bit  String  101,  104 

Boolean  Expression  Entry  186 

Bosyj  30 

Bottom  Part  22,  33  , 

Branch  and  bound  149 

BDL  26,  27 

BDL  Program  27 

BDL  Specification  28 

BINARY  Attribute  83,  84,  85,  90,  145 

BIT  attribute  82,  83,  84,  85,  90,  145 

BLOCKSIZE  Keyword  67,  96,  143,  163 

BNF  Specification  101 

BUILDD  Procedure  399 

BY  Keyword  100,  139 

Chang  33 

Character  string  86,  101,  104 
Character  Set  47 

Code  Generation  4,  14,  35,  36,  41,  147,  424 
Common  Data-area  154 
Compilation  4,  12,  35,  42 
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Compilers  1 

Completeness  6,  7,  26,  34  i 

Completeness  Analysis  13,  284-287,  295,  388 

Compound  edqe  309 

Compound  node  309 

Compound  Threshold  371 

Computable  Elements  347 

Consistency  6,  7,  26,  34,  332-373 

Consistency  Analysis  13,  284-286,  288,  374,  388 

Consistent  308,  325,  332,  333 

Control  Statements  9,  41 

Cross  Reference  Report  394,  399 

Cycle  free  417 

Cycle  Analysis  380 

Cycles  39,  286 

CEIL  Function  263 

CHAR_CODE  Keyword  68,  96,  143,  163 
CHARACTER  Attribute  82-84,  145 
CHEKY  Procedure  128 

i 

CHKCONT  Procedure  253,  265 
CHKSCON  Procedure  288,  374,  382 
CLINC  Procedure  383 
COBOL  29 

COMPRESS  Command  199,  201 
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CSUBLST  Procedure  287,  295,  298,  300 
CSUBRED  Procedure  287,  295,  301 
CTHRES  Procedure  382 
CYCLES  Procedure  380,  387,  388,  417 
Data  Description  Section  43,  44,  400 
Data  Description  Statements  8,  9,  24,  39,  51,  66 
Data  Flow  Language  26 
Data-area  154 
Decomposability  306 
Decomposable  323 
Decomposed  Graph  306,  340 
Decomposition  Criterion  (1)  337,  348,  353 
Decomposition  Criterion  (2)  344 
Decomposition  Criterion  (3)  363 
Definiteness  21 
Denotational  Language  32 
Derivation  Tree  179,  181,  190,  229 
Descriptive  Language  1,  2,  7 

Dictionary  4,  13,  36-38,  146-148,  154,  168,  171,  203-205 
Dictionary  Index  168,  207 
Diophantine  Equation  254,  255,  260,  267 
Directed  Graph  6,  7,  39,  193-195,  243,  252,  276,  278 
Directory  4,  36,  37 
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Directory  Entry  149 
Distribution  30 

Do-element  306,  314-316,  362,  363 
Do-loops  323 

Documentation  4,  35,  36,  40,  147,  394 
Domain  Dependent  11 
Domain  Expert  11,  12,  28 
DATA_LIST  Structure  155,  240,  241,  305 
DDL  Project  122 

DECIMAL  Attribute  83-86,  90,  145 
DELETE  Command  199 

DENSITY  Keyword  69,  72,  96,  142,  163 
DPIELD  Structure  213 
DIOPH  Procedure  265,  266,  268,  271 
DIOPHL  Procedure  268,  272 
DIRTEMP  217,  218 

DISPOSITION  Keyword  68,  78,  96,  143,  163  * 

DMEDIA  Structure  213 

DO  Keyword  9 ^ 

DSNAME  Keyword  75,  78,  96,  142,  163 
DUMP  Command  199 

DUPLUDE  Procedure  205,  209,  210,  241 
Encoding  and  Storing  Routines  126,  130 
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End  of  Statement  48 


* 


498 


End-element  306,  314,  315,  362,  363 
Error  code  114,  132,  133 
Error  Peedback  Loops  16,  18 
Error  Handling  49 

Error  Stacking  Routines  124-126,  128 
Execution  12,  42 
Execution  time  optimization  423 
Exist  name  199,  222 
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EBNP 

46, 

122,  123, 

125, 

130, 

136, 
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EBNP 

for 

MODEL  123, 

125 

ELSE  Keyword  100 

EMI  11,  12 

END  keyword  47 

END  Statement  137 

ENDPILE  Keyword  47,  114-116 

ENDGRP  32,  47,  81,  114,  US,  425 

ERROR  Pile  394,  413 

EXCPs  424 

EXIST  Keyword  60,  114,  116,  223,  249 
EXT_NANB  Keyword  69,  96,  142 
Finitenesa  20 
Ploat  Data  105 
Flowchart  Report  41 


Pot-clause  57-59,  94,  103 

For-each  Names  295,  297 

Format  of  Precedence  Matrix  Entries  176 

Formatted  MODEL  Specification  41 

Function  Statements  99,  108 

FIELD  Statement  59,  66,  82,  90,  92,  144,  158 

FILE  Statement  66,  74,  79,  158,  162 

FILLER  Keyword  292 

FIXED  Attribute  83,  85,  145 

FLOAT  Attribute  83,  85,  86 

FLOOR  Function  263 

FOR  Keyword  100,  102,  139 

FROM  Keyword  100 

FUNCTION  Keyword  109,  110,  138 

GtTVARS  Procedure  376 

Gana  388 

Gar bass i 118 

Gauss-Seidel  113 

Generality  21 

Graph  Algorithms  7 

Graph  Theory  417 

Graphic  Display  Tetmihal  27 

GCD  Function  257,  272 

GCYCLES  Procedure  382 
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208,  211, 
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GENASSI 

Procedure 

231-233 

GENQST 

Procedure 

221-224 

GENSTMT 
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211-213, 

224, 

226, 

227 

GET_AE 

Procedure 

225,  226, 

228, 

231, 

233 

GETFNAM 

Procedure 

245,  248 

GETQNM 

Procedure 

217,  219 

GEXACTD 

Procedure 

300 

GIMPA  Procedure  293 
GIMPI  Procedure  293 
GOTO  Keyword  9 

GRNGDIN  Procedure  287,  295,  296 
GROUP  Statement  66,  81,  144,  158 
GSUBNPT  Procedure  299 
GTVARPS  Procedure  376 
Header  Section  43,  44,  60,  400 
Hierarchical  Relationships  191,  192 
High  Level  Languages  1,  15,  22 
HELP  Keyword  47 

HELP  Statement  50,  113,  114,  131-134,  137 
I-element  31 

I/O  Channel  time  optimization  424 
I/O  Operations  9,  20,  41 
Incompleteness  11,  39,  40,  289 
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Inconsistency  39,  40 
Inclement  225,  227 
Indecomposable  347 

Induced  Atray  Gtaph  (Induced  Graph)  311-373 

Induced  Attay  Graph  - Definition  311 

Induced  Graph  311-373 

Initial  Statements  99,  110 

Input  Node  290 

Integer  Entry  184 

Interim  Data  Names  59,  241 

Interpretation  436,  440 

Inventory  30 

Iterations  32 

Iterative  Relation  6 

IBM  26,  31 

IF  Keyword  47,  100,  102,  139 
1 IFENAMS  Procedure  287,  295,  297 

INDEX1  entry  156,  241 
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INDEX2  entry  156,  241 
INITIAL  Keyword  110,  138 
INT_NAME  Keyword  69,  96,  142,  163 
INTERIM  Statement  66,  92,  144 
IS  Keyword  138 
ISDOS  33 
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Key  name  Prefixes  152 

Key  names  in  Directory  149,  151-153 

Keywords  in  MODEL  47,  114 

KEY  Keyword  75,  96,  142 

L Node  311,  312 

Learning  Process  19 

Length  name  199,  222 

Lexical  Analyzer  127 

Library  Maintenance  Sub-system  196 

Linked  list  149 

Long  30 

Look-ahead  Peature  127,  128 
Loose  Model  29 

Lower  bound  80,  103,  164,  225,  227 

LENGTH  Keyword  47,  114,  116,  223,  249 

LET  Keyword  100,  139 

LEX  Procedure  126,  128,  130 

LEXBUPN  Procedure  126,  128 

LEXCSTR  Procedure  129,  140 

LEXENAB  Procedure  130 

LBXREAD  Procedure  126 

LIBCR  Procedure  197,  198,  202 

LIBU  Procedure  197,  198,  200-202 

LINESIZE  Keyword  68,  96,  142 
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LIXBITR  Procedure  140 
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Machine  Instructions  15 
Malhotra  30 

Man-machine  Interface  1,  6,  9,  10,  30,  444,  445 

Map  Vector  169-171,  259-261,  264,  269-271,  323-373 

Map  Vector  - Definition  259,  323-325 

Map  Vector  Structure  171 

Martin  30 

McCracken  118 

Memory  optimization  418 

Michie  2 | | 

Mini-manual  for  MODEL  50 

Misc-data  Entry  184 

Model  completion  29 

Module  Name  41 

Module  Name  Statement  60 

Moore  School  25,  33,  122 

MACRO  Library  196 

MAPV  Procedure  376 

MAX  Function  263 

MEDIA  Statement  66,  73,  79,  88,  141,  158 
MERGSUB  Procedure  226,  227,  234,  236,  304 
MIN  Function  263 
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MIT  30 

MOD  Function  257,  258 

MODEL  (MODEL-III)  2,  6,  8,  13,  25,  31,  43,  113,  118,  290 
MODEL  Input  Format  46 
MODEL  Keyword  114,  132 
MODEL  Language  131 


MODEL  Processor 
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13, 
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118,  122, 

123, 

203, 

204, 

207, 

284, 
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394, 
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415,  417,  433 
MODEL  Programs  26 

MODEL  Specification  3,  6,  26,  94,  118,  121,  135,  175,  243, 
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Statement  Library 
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41. 

135, 
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202 

MODEL 

Statements  6,  12, 

45, 

46, 

49, 

131,  244, 

395 

MODEL- I 24 
MODEL-I I 25 

MODELOUT  File  394,  400 
MODULE  Keyword  47,  137 
MONITOR  Procedure  209,  247 
Names  in  MODEL  47 
Natural  Science  26 

Necessary  and  Sufficient  Condition  347 

Non-compu table  347 

Non-procedural  Language  2,  6,  8,  24 
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Non-procedural  Specification  2 
Non -prog rammer  10,  28,  32,  433 
Numeric  character  86 
NEW  Keyword  223 
NO_TRKS  Keyword  97,  142 
NOPAL  33 

NUMERIC  Attribute  83,  89,  90,  145 

01  edge  307 

02  Edge  307 
Offset  230 

Optimization  4,  14,  36,  417 

Optimizing  Compiler  42 

Order  Vector  368 

Output  Node  290 

OBT#P  Procedure  293,  380 

OLD  Keyword  223 

OPAL  33 

ORDERV  Vector  362,  384 

ORGANIZATION  Keyword  67,  95,  97,  143,  163 

OWL  Language  30 

Packing  426 

Parallelism  7,  26 

Partitioned  Data  set  78 

Partitioning  21,  23,  26 
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Path  326 

Picture  characters  87,  89 
Pnueli  25,  425 
Pointer  Name  199,  222 
Pointing  Relationships  191,  192 

Precedence  Matrix  4,  13,  36-38,  146,  168,  175-179,  191, 

203,  204,  244,  291 


Precedence  Relations  35,  36,  38,  147,  203 

Precedence  Relationship  191,  192,  289,  332 

Precise  model  29 

Preprocessor  29 

Prerequisite  Training  20 

Prime  256 

Problem  Representation  17,  18 
Problem  Solving  Process  16-18 
Procedural  Lanquage  2,  21 
Proceduralness  20 
Procurement  30 
Program  Generation  21 
Program  Maintenance  21,  23 
Program  Writer  30 
Programming  Semantics  10 
Prywes  22,  24 

PAGESIZE  Keyword  68,  73,  97,  142,  163 
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PARITY  Keyword  68,  72,  97,  143,  163 
PICTURE  Attribute  83,  87,  90,  91,  145,  163 
PL/1  9,  24,  30,  41,  42,  71,  77,  85,  86,  87,  89,  102,  123, 

215,  221,  424,  431-433,  439,  441,  443 
PL/l-P  433 

POINTER  Keyword  47,  114,  116,  117,  249 
PREDL  Vector  294,  279-381 
PRED8L  Vector  294,  279-381 

» 

PROTOSYSTEM- I 30,  32 
PRTADJL  Procedure  407 
PRTADJM  Procedure  407 
PRTASSX  Procedure  400 
PRTMED  Procedure  400 

PRTSTMT  Procedure  400  

PSA  34 
PSI  28 

1 PSL  34 

Qualified  Names  49 

a 

Query-By-Example  31 
R Node  311,  312 

Recognizer  Routines  124,  126,  128,  129 
Record  Pormat  70 
Record  Length  70 
Relatively  Prime  256 
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Rin  24,  25,  425 

Ruth  24,  30 

RANGSR  Procedure  294 

READ  Keyword  9 

RECEXP  Procedure  129,  130 

RECPINT  Procedure  140 

RECFLOT  Procedure  129,  140 

RECPM  Keyword  67,  97 

RECINT  Procedure  129,  140 

RECNANE  Procedure  124,  128,  140 

RECOR  Procedure  129,  130 

RECORD  Statement  66,  79,  81,  83,  158 

RECORDSIZE  Keyword  67,  97,  143,  163 

REFER  option  of  PL/I  9,  221 

REFER  Keyword  47,  137 

REFER  Statement  63,  127,  135,  137,  196,  202 
REMOVES  Procedure  288,  374,  376-378,  380 
RESIC  Procedure  211,  285,  287,  289,  305 
RESIC8  Procedure  287,  289,  293 
RETRf E Procedure  148,  164,  166 
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RICONS  Procedure  285,  288 

\ 

RICONSE  Procedure  376  » 

RICONS8  Procedure  288,  374,  375 

RSORT  Procedure  205,  209,  210,  240 

RUN-1  Example  118-121,  399,  400,  410,  413 

Scale  Factor  369 

Schedulability  7,  355,  373 

Schedulable  259,  307,  308,  317,  360,  380-383 

. 

Schedule  306,  314-325 
Schedule  - Definition  316 
Schedule  Element  316,  317 

. 

Section  Name  41 

Self  Map  Vector  328-373 

Semantic  Error  Report  394 

Semantics  10,  27 

Sequence  of  operations  9 

Sequenceability  7,  286,  306,  355 

Sequenceable  286,  306-308,  317,  332,  380-383 

Sequencing  4,  5,  14,  36,  147,  417 

Simple  Assertion  104,  138,  139 

Simple  Assertion  Entry  181,  233 

Simultaneous  Equations  99,  110-112,  160 

Single  Assignment  Property  32 

Social  Science  26 
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Software  Development  Process  3,  21-23 

Solution  of  a.x+b.y*n  256 

Solution  Method  Statements  99,  112 

Source  Data  Names  241 

Source  File  Statement  61 

Source  Listing  394,  395 

Source  Variable  Statements  99,  106 

Splitting  7,  319 

Standard  form  of  Subscripts  228 

Stanford  28 

Statement  type  114,  132,  133 
Statement  Number  40 
Status  Bits  156,  157,  240 

Storage  and  Retrieval  Sub-system  126,  131,  146,  147 

Storage  Entry  4,  36-38,  148,  150,  154,  172 

Subscript  Propagation  383 

Subscript  Redundancy  301 

Subscript  Specification  289 

Subscripts  6,  26,  102,  169 

Sufficient  Condition  360 

Supporting  Routines  124-126 

Syntactical  Errors  48 

Syntax  Analysis  4,  13,  35-37,  122,  125,  126,  146,  147 
Syntax  Error  Report  394,  410 
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Syntax  Notation  45 

System  Level  Specification  21,  23 

SAP  122,  123,  125,  132,  136 

SAPERR  File  394,  410 

SAPG  122-125,  127 

SAPOUT  Pile  394,  395 

SBA  31 


SCHEDUL  Procedure  361,  375 


SECTION  Command  199 
SECTION  Keyword  47,  137 
SETtOFE  Piocedure  305 

SETDTRP  Procedure  215,  217,  218,  220,  224,  226,  227 

SBTTYPE  Procedure  246-249,  253,  265 

SETT56  Procedure  247,  248 

SBTT9  Procedure  287,  305 

SOLUTION  Keyword  112,  138 

SOURCE  Keyword  47,  106,  107,  137,  138,  223 

SPACE  Keyword  75,  78,  97,  141,  163 

SRCHFIL  Procedure  221,  222 


SSL  31 

SSUBRNG  Procedure  300 

STARTUP I LB  Keyword  75,  97,  142 

STORE  Procedure  131,  148,  164,  165,  214,  233 

SUBPRC  Procedure  386,  387 
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SUBPROP  Procedure  288,  297,  383 

SUBPRU  Procedure  386,  387 

SUBSCRIPT  Statement  66,  93,  94 

SUBSET  Keyword  47,  114,  117,  223,  425 

SUCC8L  Vector  294,  379,  380,  381 

SUM  Punction  119 

SUPPORE  Procedure  287,  295,  304 

SURGE  29,  30 

SYNTAX  Keyword  114,  132 

SYSIN  Pile  200 

SYSPRINT  Pile  200,  394,  407 

Tabular  Language  28 

Tabular  Report  Preparation  30 

Target  Data  Names  241 

Target  Pile  Statement  62 

Target  Variable  Statements  99,  108 

Test  Statements  99,  111 

Threshold  308,  323,  360,  363-373 

Threshold  - Definition  323,  324,  332 

Tinaztepe  33 

Tolerant  8,  26 

Top  Part  22,  33 

Topological  Sorting  41,  318 

Transaction  Processing  24 
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Type-1  Relationship  191,  206 

Type-2  Relationship  191,  206 

Type-3  Relationship  192,  206 

Type-4  Relationship  192,  206 

Type-5  Relationship  192,  206 

Type-6  Relationship  192,  206 

' Type-7  Relationship  192,  206 

Type-8  Predecessor  294 
Type-8  Relationship  192,  206,  265 
Type-8  Successor  294 
Type-9  Relationship  192 
TAB  Keyword  69,  97,  141,  163 
TAPELABEL  Keyword  68,  98,  143,  163 
TARGET  Keyword  47,  108,  137,  138,  223 
TEST  Keyword  111,  138 
) TESTRP  Procedure  144 

. THEN  Keyword  100,  102 

TO  Keyword  139 

TRACKS  Keyword  69,  72,  97,  163 
Underlyinq  Graph  306,  309,  315,  317,  327 
Univ.  of  Nichiqan  33 

I 

Unpack inq  426-428 


Uppec  bound  80,  103,  164,  225,  227 
User  Reference  Manual  43 
UNDEFINEDFILE  - PL/1  on-unit  114 
UNIONL  Procedure  386 
UNIT  Keyword  67,  98,  143,  163 
UNIVAC  90/70  196 
UPOASSN  Procedure  299 
UPDINTM  Procedure  386,  387 
USC/ISI  29 

Value  Dependency  Relationships  191,  192 
Variable  Entry  185 
Variable  Name  Entry  185 

VARLIST  Structure  170,  173,  232,  234,  237,  250 

VARIABLE  Keyword  138 

VARLISTP  Pointer  170,  173,  234,  237 

VARYING  Attribute  83,  163,  164 

VHLL  Processors  24 

Weiqht  Vector  371 

Weighted  Adjacency  Matrix  175 

Weighted  Threshold  267 

WARNING  Pile  394,  415 

WERROR  Procedure  413 

WRITE  Keyword  9 

WSAPOUT  Procedure  395 


WWARN  Procedure  415 


XNLIST  Procedure  386 
XREF  Pile  394,  399 
XREPPRT  Procedure  399 
6-tuple  Representation  260 
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